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Introduction 

The  need  to  pursue  questions  of  causality  and  causal  inference  with 
nonexperimental  data  —  that  is,  data  based  on  naturally  occurring  events  — 
has  been  recognized  for  some  time  in  disciplines  such  as  biometrics, 
econometrics,  and  sociology  (cf.  Blalock,  1971;  Duncan,  1975;  Goldberger  & 
Duncan,  1973;  Heise,  1975;  Johnston,  1972;  Joreskog,  1970;  Namboodiri,  Carter, 
&  Blalock,  1975;  Wright,  1934,  1960).  More  importantly,  a  family  of  empirical 
procedures  designed  to  evaluate  the  utility  of  causal  hypotheses  and  to 
support  inferences  regarding  causality  among  naturally  occurring  events  is 
gaining  rapid  exposure  in  psychology  (cf.  Bentler,  1980;  Bentler  &  Bonett, 
1980;  Bentler  &  Weeks,  1980;  Cook  &  Campbell,  1979;  James  &  Singh,  1978; 
Joreskog,  1978;  Joreskog  &  Sorbom,  1979;  Kenny,  1979;  Maruyama  &  McGarvey, 
1980;  Werts  &  Linn,  1970).  The  term  "confirmatory  analysis"  is  used  here  to 
refer  to  this  family  of  procedures,  which  includes  confirmatory  factor 
analysis,  linear  structural  relations,  path  analysis,  structural  equations, 
and  time  series.  The  term  "confirmatory”  denotes  that  these  procedures  are 
designed  to  evaluate  the  utility  of  causal  hypotheses  by  testing  the  fit 
between  a  theoretical  model  and  empirical  data.  If  a  theoretical  model  is 
shown  to  have  a  "good  fit"  with  the  data,  then  the  model  is  regarded  as 
confirmed.  Conversely,  a  theoretical  model  is  discon firmed  if  it  has  a  "poor 
fit"  with  the  data. 

Confirmatory  analysis  is  likely  to  assume  an  important  place  in  the 
repertoire  of  psychological  research  methods.  Bentler  (1980,  p.  420),  for 
example,  described  linear  structural  equation  models  with  latent  (i.e., 
unmeasured)  variables  as  having  "...the  greatest  promise  for  furthering 


psychological  science."  Another  example  is  provided  by  the  current  editor  of 
Psychometrika  (Cliff,  1980)  who  stated,  "The  development  of  the  rigorous  and 
generalized  methods  for  testing  hypotheses  concerning  underlying  structures  in 
covariance  matrices  is  perhaps  the  most  important  and  influential  statistical 
revolution  to  have  occurred  in  the  social  sciences."  However,  there  have  also 
been  words  of  caution  about  the  strong  emphasis  in  the  literature  on  the 
statistical  methods  of  confirmatory  analysis  and  the  comparatively  weak 
emphasis  on  the  assumptions  that  justify  the  use  of  these  methods.  Tb 
illustrate.  Cliff  (1980)  went  on  to  say  that  while  the  methods  may  be  "...a 
great  boon  to  social  science  research,  ...there  is  some  danger  that  they  may 
instead  have  been  a  disaster,  a  disaster  because  they  seem  to  encourage  us  to 
suspend  our  normal  critical  faculties."  Billings  and  Wroten  (1978)  and  James 
(1980)  vrere  less  equivocal,  noting  that  failure  to  pay  attention  to  potential 
immeasured  causes  in  path  analytic  models  has  resulted  in  biased  estimates  of 
path  coefficients  and  erroneous  causal  inferences. 

There  is  a  serious  need  to  specify  the  conditions  that  justify  the 
application  of  confirmatory  analysis  and  the  use  of  the  results  of 
confirmatory  analysis  to  support  causal  inference.  Our  objective  is  to 
specify  and  to  discuss  these  conditions.  Tb  meet  this  objective,  we  will  show 
how  these  conditions  follow  from  a  nimber  of  fundamental  philosophical 
assumptions  about  the  nature  of  causality  and  about  how  causes  may  be  known. 

We  vri.ll  begin  our  discussion  in  Section  1  with  an  overview  of  the 
philosophical  issues  surrounding  the  idea  of  causation.  This  section  is 
designed  to  help  the  reader  who  is  not  familiar  with  these  issues  see  how 
philosophical  questions  about  causation  must  be  settled  before  one  proceeds  to 
develop  a  methodolgy  for  dealing  vrith  causation  in  the  behavioral  and  social 
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sciences.  In  this  discussion  we  will  state  inhere  we  stand  philosophically  on 
these  issues,  recognizing  that  the  topic  of  causality  is  still  one  of  the  most 
controversial  in  the  philosophy  of  science.  We  will  also  introduce  an 
interpretation  of  causality  that  we  feel  corresponds  to  the  intuitive  use  of 
the  concept  by  scientists  (Simon,  1977),  and  then  use  this  interpretation  to 
establish  a  rationale  for  testing  the  utility  of  causal  hypotheses  by 
confirmatory  analysis. 

The  rationale  for  testing  the  utility  of  causal  hypotheses  by 
confirmatory  analysis  will  be  developed  in  greater  detail  in  Section  2  by 
overviewing  ten  conditions  vhich,  if  reasonably  satisfied,  justify 
confirmatory  analysis.  The  role  of  confirmatory  analysis  in  causal  inference 
will  be  addressed  in  Section  3,  where  we  will  discuss  the  advantages  and 
disadvantages  of  confirmatory  analysis  in  the  context  of  the  equivocality  of 
causal  inference.  Finally,  Section  4  will  be  devoted  to  an  overview  of  latent 
variable  models,  which  are  models  designed  to  evaluate  the  utility  of  causal 
hypotheses  among  theoretical  constricts.  As  discussed  in  greater  detail 
later,  "latent  variables"  are  abstractions  or  theoretical  constructs  that  are 
not  amenable  to  direct  measurement  (e.g.,  common  factors). 

The  presentation  is  relatively  nontechnical  inasmuch  as  our  goal  is  to 
discuss  logic  and  assumptions.  Nevertheless,  confirmatory  analysis  cannot  be 
addressed  independently  of  methodology.  Those  desiring  more  comprehensive 
treatments  of  methods  are  referred  to  the  references  cited  throughout  the 
paper.  Finally,  although  this  presentation  of  confirmatory  analysis  is 
limited  to  causal  inference  with  naturally  occurring  events,  we  also  affirm 
that  the  logic  and  procedures  extend  to  experimental  data  (cf.  Kenny,  1979; 
Miller,  1971). 
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1_.  AN  INTERPRETATION  OF  CAUSALITY 
Causality  is  a  complex  topic,  beset  by  controversy  because  of 
metaphysical  and  epistemological  differences  among  philosophers  of 
science.  Nevertheless,  we  believe  that  an  understanding  of  causality  is 
helpful  for  understanding  the  conditions  that  justify  confirmatory 
analysis  and  the  use  of  the  results  of  confirmatory  analysis  to  support 
causal  inference.  It  is  not  our  aim  to  explore  exhaustively  the  theories 
by  which  we  come  to  know  things  or  to  delve  deeply  into  the  metaphysical 
aspects  of  causation,  however  important  these  may  be  to  a  full 
understanding  of  causality.  Rather,  we  wish  to  develop  an  understanding 
of  causality  that  is  compatible  with  the  methodological  treatment  of 
causality  in  scientific  inquiry. 

Causality  and  Necessary  Connection 

Vtoen  we  seek  to  understand  an  event  or  an  object,  what  we  often  want 
to  know  is  what  caused  or  determined  it.  Thus,  at  first  view,  causality 
seems  to  involve  establishing  a  necessary  connection  in  the  form  of  an 
event  A  being  necessary  for  the  occurrence  of  an  event  B.  But  we 
shall  argue  that  causal  relations  are  not  necessary  relations.  Further, 
our  acceptance  of  a  causal  relation  must  be  tentative,  because  there  is  no 
logical  guarantee  that  the  relation  will  be  supported  by  further 
experience. 

The  perspective  that  causal  relations  are  necessary  relations  is 


exemplified  by  the  philosophical  realist  w*io  believes  in  the  independent 
existence  of  a  world  beyond  the  senses.  Causation,  for  the  philosophical 
realist,  concerns  the  manner  in  which  objects  are  able  to  effect  change  in 
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other  objects  by  means  of  various  forces  and  powers.  The  realist  wishes 
to  discover  these  forces  and  to  show  why  objects  necessarily  produce  the 
effects  that  they  do.  This  is  not,  however,  a  sufficient  account  of 
causation  for  the  psychologically  minded  v^o  wonder  how  it  is  that  we  know 
about  the  world  and  the  causes  that  objects  in  the  world  exert  on  other 
objects.  How  do  external  objects  cause  humans  to  perceive  not  just  the 
objects,  but  the  causal  forces  they  exert  on  one  another?  From  what 
information  does  the  hunan  organism  construct  internal  representations  of 
external  objects  and  causal  forces? 

We  know  that  information  about  the  world  comes  to  the  senses  in  the 
form  of  physical  energy.  Furthermore,  information  about  causes  appears  to 
reside  in  the  temporal  sequence  of  stimulus  information.  For  example, 
people  report  that  they  perceive  events  causing  other  events  as  they  watch 
a  film.  But  all  the  information  presented  in  the  film  is  varying 
intensities  of  light  and  shadow,  arranged  spatially  in  images  that  are 
presented  rapidly  in  a  sequence.  People  do  not  report  that  they  perceive 
events  causing  other  events  vhen  the  film's  image  frames  are  presented  in 
random  order.  The  perception  of  causation  depends  on  the  order  in  which 
the  information  is  presented  and  cannot  be  found  in  the  individual 
elements  of  information. 

The  British  empiricists  of  the  18th  and  19th  centuries,  reflecting  on 
the  problem  of  causation,  reached  a  similar  conclusion.  David  Hume 
(1977/1748)  argued  that  the  only  thing  he  could  find  in  experience  to 
account  for  the  idea  of  a  causal  connection  between  phenomenal  events  was 
an  awareness  of  the  repetition  of  similar  instances  in  which  an  object  of 
one  kind  is  invariably  followed  by  an  object  of  another  kind.  Nothing 
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within  the  objects  themselves  connect  them.  Only  the  repeated  pairing  of 
objects  gives  rise  to  the  idea  of  a  causal  relation  between  objects. 

"This  connexion,"  Hume  wrote,  "which  we  feel  in  the  mind,  this  customary 
transition  of  the  imagination  from  one  object  to  its  usual  attendant,  is 
the  sentiment  or  impression  from  which  ws  form  the  idea  of  power  or 
necessary  connexion.  Nothing  farther  is  in  the  case"  (Hume,  1977/1948,  p. 
50). 

Hume  particularly  ruled  out  any  logical  necessity,  in  the  form  an 
event  A  being  necessary  for  the  occurrence  of  an  event  B,  because  logical 
necessity  would  allow  for  neither  exceptions  nor  contradictions,  such  as 
the  occurrence  of  an  event  B  without  the  occurrence  of  an  event  A.  He 
argued  that  in  considering  connections  between  objects,  it  is  possible 
that  in  the  future  we  will  experience  the  connection  between  objects 
differently  than  we  have  in  the  past.  For  a  considerable  time  after  Hume, 
empiricism  discounted  any  role  for  logical  necessity  in  considerations  of 
matters  of  fact.  The  empiricists'  rejection  of  logical  necessity  led  them 
to  avoid  deduction  from  general  theories  and  to  stress  induction  and 
description. 

Modern  empiricism,  however,  restored  logic  to  respectability  in 
science  when  it  came  to  explanation  using  hypothetical  formal  models 
designed  to  represent  relations  between  objects  of  experience.  Logic 
entered  in  when  one  deduced  the  occurrence  of  specific  events  from  general 
laws  established  inductively  in  prior  experience.  To  illustrate,  Hemphill 
and  Oppenheim  (1948)  offered  the  following  schema  for  the  process  of 
explanation:  Let  E  be  an  event  to  be  explained,  described  in 
observational  terms.  Let  C^,  C^,  ...,  be  a  set  of  antecedent 


conditions.  Let  L.,  L-,  L  be  general  laws.  Then,  if  from 

_  £  “E 

the  inductively  generated  antecedent  conditions  and  the  general  laws  one 
is  able  to  deduce  the  event  El  to  be  explained,  these  antecedent 
conditions  are  the  explanation  of  E.  The  "logical  necessity"  that 
connects  the  antecedent  conditions  and  general  laws  is  not  absolute  for 
these  empiricists.  Rather,  this  "logical  necessity"  is  relative  to  the 
hypothetical  logical  system  in  vrfiich  one  assumes  that  the  antecedent 
conditions  and  general  laws  are  valid  premises.  Furthermore,  there  can  be 
no  absolute  necessity  for  such  logically  deduced  explanations  to  be  always 
correct,  for  further  experience  may  show  the  premises  on  which  they  are 
based  to  be  false. 

In  general,  inductively  based  theories  and  explanations — that  is, 
generalizations  from  the  past  to  the  present  or  future — cannot  escape  the 
tentativeness  of  inductive  inferences.  As  Hume  observed,  there  are  no 
logically  necessary  connections  in  experience.  Miether  the  necessary 
connections  of  causal  relations  are  established  by  repeatedly  observing 
invariant  successions  of  an  event  A  followed  by  an  event  B,  or  are 
deduced  logically  from  inductively  established  premises,  it  is  not 
necessary  that  one  will  continue  to  observe  such  regular  successions  or  to 
find  the  empirically  induced  premises  of  a  deductive  argument  to  be  true. 

Hume  even  came  to  question  whether  the  process  of  induction,  of 
generalizing  from  the  past  to  the  present  or  the  future,  was  rationally 
justifiable.  This  was  particularly  disturbing  to  him  because  he  saw  that 
people  used  inductive  reasoning  all  the  time  in  their  everyday  affairs.  But 
he  could  conceive  of  no  rational  argument  that  could  stand  up  to  the  logical 
possibility  that  events  could  turn  out  to  be  quite  different  from  what  was 
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inductively  expected.  Numerous  philosophers  after  Hume  have  similarly  been 
unable  to  find  rational  arguments  to  justify  the  use  of  induction. 

Nevertheless,  some  philosophers  seem  willing  to  live  with  the 
unjustifiability  of  expectations  based  on  past  experience.  Feigl  (1963) 
regarded  the  idea  that  past  experience  is  a  guide  to  v^iat  will  happen  in 
the  present  and  the  future  as  an  essential  but  rationally  unjustifiable 
premise  that  one  had  to  make  to  deal  rationally  with  experience.  Cook  and 
Campbell  (1979)  regarded  inductive,  cause- inferring  behavior  as 
characteristic  behavior  of  our  species  and  suggested  that  such  behavior 
evolved  because  it  had  survival  value.  Other  philosophers  have  not  been 
so  kind  in  dealing  with  ftme's  skepticism.  Wittgenstein  regarded  Hume's 
skepticism  about  induction  as  "unintelligible",  on  a  par  with  asking 
someone  to  justify  breathing  or  walking  (cf.  Dilman,  1973).  Capaldi 
(1969)  detected  a  subtle  circularity  in  Hume's  skeptical  argument,  namely 
that  Hume  must  rely  on  instances  in  past  experience  of  unfulfilled 
expectations  to  support  his  argument  that  past  experience  cannot  be  relied 
upon  always  to  fulfill  our  expectations.  But  if  past  experience  cannot  be 
relied  upon,  how  can  Hume  rely  upon  it  to  prove  that  it  cannot  be  relied 
upon?  Thus  he  presumed  the  very  thing  he  sought  to  undermine. 

In  sum,  causal  relations  are  not  necessary  relations.  Miile  a  causal 
relation  may  be  deduced  logically  from  prior  theoretical  assumptions  or  based 
inductively  on  an  observation  of  a  regularity  in  experience,  there  is  no 
logical  guarantee  that  the  relation  will  occur  similarly  in  future  experience. 
Thus,  our  use  of  causal  relations  must  be  tentative.  Furthermore,  if  we  do 
use  causal  relations  as  explanations  or  as  guides  for  action,  we  do  so  without 
any  rational  justification  for  why  this  practice  occurs.  It  is  basically  a 
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fundamental  human  practice. 

The  Form  of  a  Causal  Relation 

Some  philosophers  conceptualize  causal  relations  as  following  the  form 
of  logical  implication.  These  forms  are  reviewed  briefly  below.  We  will 
then  argue  that  causal  relations  need  not,  and  perhaps  should  not,  be  con¬ 
ceptualized  as  following  the  form  of  logical  implication.  Rather,  we  will 
argue  that  causal  relations  follow  the  form  of  an  asymmetric  functional 
relation  in  a  self-contained  system. 

In  regard  to  logical  implication,  causal  relations  have  often  been 
viewed  as  taking  one  of  the  following  three  interrelated  forms  (Byerly, 

1973):  (1)  Sufficient  condition:  An  event  A  is  causally  sufficient 

for  an  effect  B  if  B  occurs  whenever  A  occurs.  (2)  Necessary 
condition:  An  event  A  is  causally  necessary  for  an  effect  B  if  B 
never  occurs  without  A  having  already  occurred.  (3)  Necessary  and 
Sufficient  condition:  An  event  A  is  a  necessary  and  sufficient 
condition  for  an  effect  J3  if  B  never  occurs  vihen  A  has  not  occurred 
and  B  always  occurs  when  A  has  occurred. 

Treating  causation  as  having  one  of  the  forms  of  logical  implication 
leads  to  conclusions  vrtiich  are  incompatible  with  the  common  language 
conceptions  of  causation.  For  exanple,  the  statement,  "If  A,  then  B" 
logically  implies  "If  not  E3,  then  not  A."  But,  if  the  causal  relation 
follovs  the  form  of  logical  implication,  this  would  mean  "A  causes  B" 
implies  "not  B  causes  not  A",  which  makes  no  sense.  Tb  use  a  concrete 
example,  if  the  causal  relation  follows  the  form  of  logical  implication, 
then  the  statement,  "John's  pressing  the  accelerator  pedal  causes  the  car 
to  accelerate"  implies  "The  car's  not  accelerating  causes  John  not  to 
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press  the  accelerator  pedal."  Not  only  does  this  sound  absurd,  but  also 
f  it  points  out  the  need  for  an  approach  to  causality  that  "corresponds  to 

the  intuitive  use  of  that  term  in  scientific  discussion"  (Simon,  1977,  p. 
77). 

.  Simon  (1952,  1953,  1966,  1977)  argued  that,  formally  speaking,  the 

form  of  the  causal  relation  does  not  take  the  form  of  logical 
implication.  Rather  the  causal  relation  takes  the  form  of  an  asymmetric 
functional  relation  in  a  self-contained  or  closed  system.  These 
points  are  illustrated  in  Figure  1.1,  which  concerns  the  functional 
relation  between  an  effect  Y  and  a  cause  X.  As  shown  at  the  top  of 
Figure  1.1,  there  are  12  scores  on  each  variable.  The  variable  X  has 
six  possible  values  (i.e.,  1,...,6),  and  the  variable  Y  has  three 
possible  values  (i.e.,  1,2,3).  Each  separate  occurrence  of  an  X  or  a 
Y  is  an  event.  Thus,  this  example  has  24  events.  Each  value  of  X 
and  Y  defines  a  class  of  events.  For  example,  X  *  5  defines  a  class 
of  events  with  the  common  value  of  5,  and  there  are  two  events  in  the  X 
=  5  class.  The  var iable  X  is  comprised  of  a  set  of  classes  of  events. 

The  term  set  refers  to  the  six  classes  of  events  associated  with  the  six 
possible  values  that  may  be  assuned  by  X.  The  set  of  classes  of  events 
for  Y  encompasses  the  three  values  that  may  be  assuned  by  Y. 


Insert  Figure  1. 1  about  here 


Simon  (e.g.,  1977,  p.  108)  viewed  a  causal 
an  effect  (Y)  on  to  one  or  more  causes  (Xs) .  In 


relation  as  a  function  of 
this  example,  we  have 
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one  cause,  and  the  functional  relation  takes  the  form  Y  =  _f(X).  It  is 
particularly  important  to  note  that  the  causal  relation  between  X  and 
Y  is  a  relation  determined  over  the  full  range  of  values  on  X  and  Y. 

It  is  not  merely  a  relation  between  the  occurrence  of  two  events  (e.g., 

X  =  1,  Y*l),  or  a  relation  between  the  occurrence  of  two  classes  of 
events  (e.g.,  X  =  6,  6;  Y  =  3,  3).  Let  us  now  address  what  is  meant 
by  self-contained  and  asynsnetric. 

The  function  Y  =  fjX)  is  self-contained  because  one  and  only 
one  value  (class  of  events)  of  Y  is  associated  with  each  value  (class  of 
events)  of  X.  This  implies  that  the  values  of  Y  are  determined 
completely  by  the  values  of  X;  that  is,  given  knowledge  that  X  has 
occurred  and  the  values  of  X,  .we  can  determine  exactly  the  values  of 
Y.  The  function  is  asymmetr ic  because  it  is  impossible  to  reverse  the 
direction  of  causation  [i.e.,  X  =  f_(Y ) ]  and  maintain  unique 
determination.  For  example,  Y  =  1  is  associated  with  X  =  1,  3,  or  5, 
which  means  that  X  cannot  be  uniquely  determined  by  the  knowledge  of  the 
values  of  Y. 

In  more  general  terms,  Simon  considered  that  causal  relations  in 
science  usually  concern  quantitative  functional  relationships,  such  as 
the  amoint  of  wheat  grown  is  a  function  of  the  amount  of  rain  and  the 
amount  of  fertilizer.  In  the  simple  case  where  an  effect  (y)  is 
determined  by  two  causes  (x^  and  Xj),  the  functional  relation  is 
denoted  by  y  =  £  (x^,  x^) ,  where  lower  case  letters  refer  to 
variables  in  deviation  score  form.  This  means  that  the  occurrence  of  the 
effect  y  is  dependent  on  the  occurrence  of  the  causes  x^  and 
in  the  manner  prescribed  by  the  function  £.  The  function  £  may  take 
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many  forms.  For  simplicity's  sake,  the  treatments  in  this  paper  are 
generally  predicated  on  linear  functions.  Thus,  for  example,  the 
functional  relation  y  =  fjx^,  x_2)  expressed  in  linear  form  denotes 


that  ^  is  a  function  of  a  weighted,  additive  combination  of  the  values 

of  x.  and  x_,  namely  y  =  E3  x.  +  EJ  x_.  When 
£  £  ~i2ii  £  jl*2  — 

expressed  as  equations,  functional  relations  are  referred  to  as 

functional  equations. 

A  functional  equation  implies  that  the  system  is  self-contained,  which 
is  to  say  that  the  values  of  the  effect  y  ate  determined  completely  by  the 
function  that  relates  the  values  of  the  causes  x^  and  Xj  to  y.  For 

example,  the  (linear)  functional  equation  y  =  B  x.  +  B  x_ 

£  ~Y*2  — 

represents  a  self-contained  or  closed  system  in  which  values  of  y  are 
completely  determined  by  the  weighted,  additive  combination  of  the  values  of 
x^  and  x2.  The  asymmetry  of  causation  may  at  times  be  illustrated  as 
in  our  example  above,  where  y  is  completely  determined  by  the  function 
relating  y  to  Xj  and  x2,  but  no  function  can  be  found  that,  for 
example,  completely  determines  the  values  of  x^  based  on  a  function  of  y 

and  it,.  However,  the  asymmetry  reflected  in  a  functional  equation,  in  the 

context  of  a  causal  relation,  is  typically  an  imposed  asymmetry  based  on  the 
presumed  order  in  which  variables  occur  naturally.  To  1 1  listrate,  the 
frequency  of  automobile  accidents  may  be  a  function  of  the  amount  of  snowfall, 
but  it  is  unlikely  that  snowfall  is  a  function  of  the  number  of  automobile 
accidents  (Darlington,  1968).  Thus,  the  logic  of  asymmetry  of  causal 
relations  is  that  setting  y  =  f (x)  does  not  imply  that  x  =  £(y) . 

Causal  analysis  is  generally  predicated  on  a  set  of  variables,  where 
multiple  effects  are  regarded  as  functions  of  multiple  causes.  Each  effect  is 
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represented  by  a  functional  equation,  and,  based  on  the  natural  causal 
ordering  among  the  variables,  a  system  of  functional  equations  is  devised  in 
which  a  variable  which  acts  as  an  effect  in  one  equation  is  a  cause  in  an 
equation  later  in  the  causal  order.  For  example,  a  simple  system  may  be 
comprised  by  two  linear  equations,  such  as  y^  =  b^  x  x^,  and  ^ 

=  b  y^,  where  the  natural  causal  ordering  is  Xj7  and 
then  y^.  The  y^  and  y_2  equations  may  be  regarded  as 

self-contained  subsystems  writhin  the  system  of  equations.  Furthermore,  if 

each  of  the  functional  equations  embedded  in  the  system  of  equations  is 

self-contained,  then  the  system  is  self-contained,  which  connotes  that  there 

are  as  many  equations  as  effects  to  be  determined.  As  we  shall  see  later, 

self-containment  of  a  system  of  equations  suggests  that  parameter 

values  (e.g.,  b  )  in  functional  equations  have  a  unique  value  and 
"*1% 

may  be  estimated.  We  shall  also  see  how  these  estimates  are  used  to  infer  the 
strengths  of  causal  relations. 

In  sunmary,  what  Simon's  functional  relation  conception  of  causality 
suggests  is  that  we  infer  the  presence  of  causal  relations  when  we  isolate 
groups  of  variables  into  self-contained  systems  of  functional  equations  in 
which  the  varying  values  of  some  variables  (causes)  appear  to  determine 
totally  the  varying  values  of  other  variables  (effects) .  Within  these 
self-contained  systems,  the  causal  pathways  among  variables  are  determined  by 
the  manner  in  vhich  the  values  of  certain  effects  in  self-contained 


subsystems  (functional  equations)  are  themselves  determined 
by  the  values  of  causes,  which  include  effects  of  other  self-contained 
subsystems  lov*er  in  the  causal  order.  The  inference  of  causation  is  an 
inductive  inference  based  on  presuning  that  the  functional  equations/relations 


describing  the  causal  connections  between  the  values  of  causal  variables  and 
effect  variables  observed  in  the  past  will  continue  to  hold  in  the  future. 

Is  Causality  Replaced  by  Functional  Relation? 

A  nunber  of  empiricist  and  positivist  writers  (e.g.,  Pearson, 

1892/1911;  Schlick,  1949;  Jeffreys,  1948),  have  expressed  the  view,  echoed 
by  some  psychologists  (cf.  Travers,  1981),  that  causality  as  a  concept  has 
been  cast  out  of  science  and  replaced  by  the  concept  of  functional 
relationship  or  by  the  even  less  binding  connection  of  contingency.  How 
does  this  view  contrast  with  Simon's  (1952)  view  of  the  causal  relation  as 
an  asymmetric,  functional  relation? 

lb  begin  with,  Simon's  view  does  not  attempt  to  settle  any 
metaphysical  issues  regarding  the  nature  of  causality  by  asserting  that 
causal  relations  are  to  be  represented  formally  by  functional  relations. 
Simon's  view  is  thus  ecunenical  in  that  it  seeks  to  provide  a  common  basis 
on  viuch  scientists  of  different  metaphysical  pur suasions  may  agree  when 
dealing  with  the  practical  problems  of  demonstrating  causality.  These 
other  empiricists,  however,  have  metaphysical  commitments  when  they  seek  to 
equate  causality  with  nothing  but  functional  relations,  that  is  to  say, 
mathematical  descriptions  and  sunmaries  of  regularities  observed  in 
experience.  For  them  the  real  only  vdiat  is  known  in  experience.  But 
because  the  phenomenal ist  psychology  of  empiricism  holds  that  the  elements 
of  experience  are  logically  unconnected,  this  rules  out  necessary 
connections  in  experience.  Since,  traditionally,  causality  concerns 
necessary  connections,  causality  no  longer  is  a  viable  concept  for 
empiricists.  Causality  must  be  replaced  by  the  weaker  relation  of 
functional  relation,  tdiich  serves  only  tc  describe  the  regularities  of 


succession  of  similar  kinds  of  events  in  experience,  these  regularities 
being  products  of  the  mind's  associative  processes. 

More  important,  however,  is  that  those  positivists  and  empiricsts 
rejecting  causality  in  favor  of  functional  relationships  have  often 
stressed  the  interdependence  of  variables  and  even  the  reversibility  or 
symmetry  of  relationships  (cf.  Bunge,  1959).  Much  of  this  thinking  can  be 
traced  to  the  writings  of  the  physicist  Ernst  Mach,  who  had  a  profound 
influence  on  Rearson  (1892/1911)  and  subsequently  on  the  adherents  of  the 
Vienna  Circle  of  logical  positivists,  which  included  Schlick  (1949)  cited 
above.  According  to  Bunge  (1959,  p.  91)  Mach  "...demanded  the  replacement 
of  every  sort  of  connection,  particularly  causal  connections,  by  functional 
relations  expressing  a  symmetrical  interdependence."  lb  illustrate  this 
view,  Bradley  (1971)  reported,  Mach  would  cite  how  the  pressure  £  of  a 
given  mass  of  gas  is  uniquely  determined  by  its  absolute  temperature  T 
and  volume  V,  according  to  the  equation  pV/T  -  z_  (a  constant) .  But 
then  Mach  would  caution  that  none  of  the  quantities  £,  V,  or  T  should 
be  thought  of  as  a  cause  of  the  other,  but  rather  as  functions  of  each 
other.  According  to  Bradley,  Mach  was  motivated  to  eliminate  the  notion  of 
causality  (interpreted  as  a  necessary  asymmetric  connection)  and  replace  it 
with  the  less  binding  and  more  descriptive  symmetric  connection  of 
functional  interdependence.  Functional  relations  were  viewed  as  describing 
regular  successions  of  phenomena  by  summarizing  the  phenomenal  facts. 

In  contrast,  the  econometrician  and  statistician  Nicholas 
GeorgescuHRoegen  (1971)  pointed  out  how  the  many  examples  of  functional 
interdependence  cited  by  ftech  and  the  positivists  after  him  come  from 
Newtonian  mechanics.  In  Newtonian  mechanics  there  is  no  unique  direction 
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in  which  processes  can  occur.  Events  can  happen  in  just  the  reverse  order 
as  we  observe  them  to  happen  and  still  be  consistent  with  Newtonian 
mechanics.  But  Newtonian  physicists,  Georgescu-Roegen  claimed,  were  faced 
with  a  crisis  when  they  attempted  to  analyze  the  phenomenon  of  heat.  The 
simple  law  that  heat  always  passes  from  hotter  to  colder  bodies  is  an 
asymmetric  relationship  not  derivable  from  Newtonian  physics  (except  by 
invoking  an  empirically  untestable  interpretation  of  probability  in  what 
is  known  as  "statistical  mechanics").  Yet  this  simple  law,  known  as  the 
Second  Law  of  Thermodynamics,  underlies  the  asymmetry  of  countless 
relationships  observed  in  nature  and  makes  meaningful  a  conception  of 
causality  as  an  asymmetric  relationship. 

Simon's  functional  relation  conception  of  causality  is  based  on 
asymmetry  of  relationship.  While  Simon  allowed  that  asymmetry  could  be 
achieved  in  a  definition  of  the  causal  relationship  by  invoking  the  concept 
that  causal  variables  would  measure  events  occurring  before  events  measured 
by  effect  variables,  he  chose  instead  to  define  causal  order  independent  of 
time.  For  him  a  system  of  (linear)  causal  relations  would  be  represented 
by  a  self-contained  system  of  independent  and  consistent  (nonhomogeneous) 
linear  equations.  Causal  ordering  in  such  a  system  would  be  indicated  by 
the  order  in  which  one  would  have  to  solve  for  the  variables  of  the  system. 
We  are  not  yet  ready  to  describe  further  mathematical  details  of  Simon’s 
self-contained  equations  and  causal  order  in  such  a  system.  As  discussed 
later,  the  equations  selected  for  such  a  system  would  be  based  on 
relationships  observed  in  prior  experience  or  those  derived  from  theory. 

The  important  point  here  is  that  Simon  believed  he  had  distilled  what  the 
scientist,  of  whatever  epistemological  persuasion,  is  concerned  with  when 
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he/she  seeks  to  establish  causes,  which  is  to  establish  functional 
relations  among  variables  in  a  self-contained,  closed  system. 

The  impact  of  Simon's  perspective  is  seen  in  econometrics  and  sociology, 
where  methods  of  confirmatory  analysis  using  structural  equations  and  path 
analysis  developed  partly  under  Simon's  influence.  Unfortunately,  Simon's 
views  on  causality  seem  to  have  been  overlooked  by  many  philosophers  of 
science  as  well  as  by  authors  of  popular  textbooks  on  causality  in  the  social 
sciences.  For  example,  no  references  to  Simon  appear  in  Cook  and  Campbell 
(1979),  Heise  (1975),  or  Kenny  (1979),  although  many  treatments  in  these  texts 
reflect  functional  relations.  In  this  paper,  we  will  adopt  Simon's  view  that 
the  causal  relation  is  a  functional  relation  among  sets  of  classes  of  events 
(i.e.,  variables)  within  a  self-contained,  closed  system. 

We  must  make  one  final  observation  concerning  the  metaphysical  status 
of  the  functional  relation  conception  of  causality  we  will  use  in  this 
paper.  We  do  not  claim  to  exhaust  the  idea  of  causality  with  such  a 
concept.  Actually,  we  are  only  interested  in  formulating  conditions  under 
which  causal  relationships  may  become  known,  not  in  stating  vhat  causality 
is.  It  is  the  phenomenal ist  empiricists  who  wish  to  make  the 
metaphysical  claim  that  causality  as  a  functional  relation  merely 
sumnarizes  regularities  of  succession  in  phenomenal  experience,  which  for 
them  is  all  there  is  to  reality.  We  will  make  no  such  claims,  since  they 
are  not  relevant  to  our  task  in  this  paper. 

Functional  Relations  in  Psychology 

Theory  plays  a  major  role  in  the  formulation  of  functional  relations 
in  psychological  research.  The  functional  relation  view  of  causality 
suggests  that  in  order  to  infer  causal  relations  we  must  isolate  groups  of 
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variables  into  self-contained  systems  in  vhich  the  varyng  values  of  effects 
appear  to  be  determined  totally  by  functions  of  values  of  causes.  However, 
because  the  number  of  possible  causal  relations  among  psychological 
variables  is  infinite,  it  is  necessary  to  rely  on  theory  to  identify  the 
variables  to  be  isolated  and  included  in  functional  equations  in  a 
self-contained  system.  Not  all  psychological  theories  furnish  a  meaningful 
basis  for  accomplishing  this  chore,  and  we  will  devote  considerable  space 
in  the  next  section  of  this  paper  to  the  conditions  v#iich  must  be 
reasonably  satisfied  if  a  theory  is  to  be  relied  on  to  develop  functional 
equations  in  a  self-contained  system.  We  preface  this  discussion  with 
interpretations  of  functional  relations  and  self-containment  in  regard  to 
their  use  in  psychology  and  other  disciplines  in  vrfiich  functional  relations 
take  the  form  of  probabilistic  function  equations. 

Functional  relations  were  represented  above  in  the  form  of 

deterministic  functional  equations.  For  example,  the  linear  functional 

equation  y  =  x,  +  B  x_  is  deterministic  because  %  is 
±  “12.2  — 

completely  determined  by  x^  and  However,  in  many  areas  of 

psychological  research,  variation  in  effects  may  occur  for  reasons  other 
than  variation  in  the  causes  included  explicitly  in  a  functional  equation. 
For  example,  variation  may  occur  in  an  effect  because  of  variation  in 
causes  not  included  explicitly  in  an  equation.  These  causes  may  have 
minor,  independent,  and  unstable  influences  on  an  effect,  in  \4iich  case 
they  are  referred  to  as  random  shocks.  The  likelihood  of  the  occurrence 
of  random  shocks  in  psychological  research  suggests  that  it  is  usually 
necessary  to  think  of  functional  relations  and  functional  equations  in 
probabilistic  terms  rather  than  in  deterministic  terms. 
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When  expressed  in  probabilistic  terms,  causal  models  involving  linear 
functional  relations  take  the  form  ^  =  fjx^x^d),  and  a  1  inear 
functional  equation  has  the  form  y  =  B  x.  +  B  x_  +  d. 

"Linear"  is  emphasized  because  we  focus  on  linear  forms  of  probabilistic 
functional  equations  in  this  discussion.  These  forms  differ  from  linear 
deterministic  functional  equations  because  probabilistic  equations  include  a 
disturbance  term,  designated  by  "d".  The  disturbance  term  is  a  surrogate 
that  represents  all  causes  of  y  not  included  explicitly  in  the  (linear) 
probabilistic  functional  equation.  That  is,  x^  and  x2  are  the 
explicitly  included  causes  and  d  represents  all  other  causes  of 
Disturbance  terms  almost  always  include  random  shocks,  which,  as  defined 
above,  are  causes  of  ^  that  are  minor,  independent,  and  unstable. 

Disturbance  terms  may  also  include  two  other  types  of  causes  that  are 
important  at  this  time,  namely  relevant  causes  and  nonrelev ant  causes. 

A  relevant  cause  is  defined  as  a  variable  that: 

(a)  has  a  nonminor,  direct  influence  on  an  effect, 

(b)  is  stable, 

(c)  is  related  to  at  least  one  other  cause  included  explicitly 
in  a  functional  equation,  and 

(d)  makes  a  unique  contribution  to  a  functional  equation,  which 
means  that  it  is  not  linearly  dependent  on  causes  already 
included  in  a  functional  equation  (cf.  James,  1980). 

A  nonrelevant  cause  is  a  variable  that  satisfies  conditions  (a),  (b) ,  and 
(d)  above.  It  is  not,  however,  related  to  one  of  the  causes  included 
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explicitly  in  a  functional  equation. 

The  distinctions  among  random  shocks,  relevant  causes,  and  nonrelevant 
causes  furnish  a  basis  for  ascertaining  whether  or  not  a  probabilistic 
functional  equation  is  self-contained.  We  will  formally  define 
self-containment  for  probabilistic  functional  equations,  and  then  show  how 
this  formal  criterion  can  be  restated  in  terms  of  random  shocks,  relevant 
causes,  and  nonrelevant  causes. 

Like  deterministic  equations,  probabilistic  equations  must  be 
self-contained  before  they  can  be  used  to  represent  functional  (causal) 
relations.  However,  unlike  deterministic  functional  equations,  it  is  not 
possible  to  require  that  the  realized  (measured)  values  of  causes  determine 
the  values  of  an  effect  in  probabilistic  functional  equations,  if  for  no  other 
reason  than  that  the  values  of  effects  will  include  the  influences  of 
unstable,  and  therefore  unmeasureable,  random  shocks.  It  _is  possible 
to  define  self-containment  for  probabilistic  functional  equations  in 
reference  to  the  expected  values  on  an  effect,  conditional  on  the  values  of 
causes  included  explicitly  in  a  functional  equation  [i.e.,  the  (conditional) 
probability  distribution  of  the  effect].  Specifically,  a  probabilistic 
functional  equation  is  self-contained  if  the  realized  values  of  causes 
included  explicitly  in  the  equation  determine  the  (conditional)  probability 
distribution  of  the  effect  variable  (Simon,  1977). 


For  example,  the  linear  probabilistic  functional  equation: 

<1-1’ 

is  self-contained  if  the  conditional  expected  values  of  y_,  given  realized 

values  of  x.  and  x~,  are  provided  by  B.  x,  +  B,  x-. 

_  _  ~z2ii  _  vii  _ 

The  conditional  equation  is  indicated  by: 
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£<X  |  *r*2>  =  5^*1  +  ^2  (1*  2> 

where  E  is  the  expectation  operator. 

Equation  1.2  requires  that  solutions  exist  for  B  and  B  , 

“£<1  1SL2_ 

which  combined  with  the  realized  values  of  x^  and  x2 ,  determine  the 

conditional  expected  values  of  y.  As  shown  in  the  next  section  of  this 

report  (Condition  5),  unbiased  solutions  exist  for  B  and  B 

“i2Li 

only  if  x^  and  x^  are  unrelated  to  d.  This,  in  turn,  connotes  that 
a  probabilistic  functional  equation  will  be  self-contained  if,  and  only  if, 
the  cases  represented  expl icitly  in  an  equation  are  unrelated  to  the 
disturbance  term  of  that  equation.  That  is,  Eq.  1.2  is  dependent  on  the  lack 
of  relationship  between  d  and  both  and  Xg  in  Eq.  1.1.  Thus,  a 
more  fundamental  definition  of  self-containment  of  a  probabilistic  functional 
equation  is  that  explicitly  measured  cases  are  unrelated  to  the  disturbance 
term  of  the  equation. 

We  are  now  ready  to  return  to  the  relationship  between  self-containment 
and  random  shocks,  relevant  causes,  and  nonrelevant  cases.  If  the  disturbance 
term  is  comprised  by  randan  shocks  exclusively,  then  d_  in  Eq.  1.1  cannot  be 
related  to  the  explicitly  measured  cases  x^  and  x2  because,  by 
definition,  random  shocks  are  unrelated  to  all  other  variables.  Similarly,  if 
d  includes  nonrelevant  causes, in  addition  to  random  shocks,  then  again  d 
is  unrelated  to  x.^  and  because,  by  definition,  nonrelevant  causes 
are  unrelated  to  explicitly  included  (i.e.,  measured)  causes.  Thus,  we  may 
conclude  that  (a)  if  the  d  term  in  a  probabilistic  functional  equation 
involves  only  randan  shocks  and  nonrelevant  causes,  then  (b)  it  is 
self-contained  because  explicitly  measured  causes  will  be  unrelated  to  d. 
However,  if  d  contains  at  least  one  relevant  cause,  then  d  will  be 


related  to  at  least  one  explicitly  measured  cause.  That  is,  a  relevant  cause 

of  ^  that  is  not  included  explicitly  in  the  probabilistic  functional 

equation  for  ^  will  be  included  in  the  disturbance  term.  By  definition, 

this  unmeasured  relevant  cause  is  related  to  at  least  one  explicitly 

measured  cause  [see  (c)  in  definition  of  relevant  cause],  from  which  it 

follows  that  either  x^  or  x^r  or  will  be  related  to  d  in  Eq. 

1.1.  Consequently,  Eq.  1.2  is  not  applicable  because  unbiased  solutions  do 

not  exist  for  B  and  E3  ,  and  the  probabilistic  equation  given 
a*!  “£*2 

by  Eq.  1.1  is  not  self-contained.  On  the  other  hand,  if  and  x2 
are  the  only  relevant  causes  of  j/,  then  Eq.  1.1  would  be  self-contained. 

This  suggests  that  the  most  fundamental  definition  of  self-containment  for  a 
linear  probabilistic  functional  equation  is  that  all  relevant  causes  of  an 
effect  are  included  explicitly  in  the  equation. 

Functional  equations  are  assumed  to  be  in  linear  probabilistic  form 
throughout  the  remainder  of  this  paper.  We  will  emphasize,  many  times,  the 
points  that  (a)  a  probabilistic  functional  equation  can  be  used  to  represent  a 
causal  relation  only  in  the  condition  that  the  equation  is  self-contained,  and 
(b)  a  probabilistic  functional  equation  will  be  self-contained  only  in  the 
condition  that  all  relevant  causes  of  the  effect  are  included  explicitly  in 
the  equation.  We  will  also  demonstrate  that,  in  naturalistic  studies,  it  is 
necessary  to  rely  on  theory  to  identify  relevant  causes  that  should  be 
included  in  a  probabilistic  functional  equation,  and  frequently,  to  build  a 
case  that  causes  not  included  explicitly  in  an  equation  are  unrelated  (or 
linearly  dependent  on)  causes  represented  explicitly  in  that  equation. 

Summary 

The  following  ideas  were  developed  in  Section  1.  Causal  relations  are 
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inductive  relations,  and  take  the  form  of  asynmetric  functional  relations 
among  certain  variables,  or  subsets  of  variables,  in  a  self-contained 
structure  (Simon,  1977).  It  is  not  assumed  that  causal  relations  are 
logically  necessary  relations  or  that  causal  relations  take  the  form  of 
logical  implication.  It  is  assumed  that  theory  plays  a  major  role  in  the 
formulation  of  functional  relations  in  psychological  research,  where 
functional  relations  are  typically  represented  by  probabilistic,  rather  than 
deterministic,  equations.  The  probabilistic  equations  must  be  self-contained 
if  they  are  to  represent  causal  relations. 

We  have  adopted  Simon's  functional  relation  perspective  of  causality  in 
this  text.  We  have  also  placed  major  emphasis  on  the  role  played  by  theory  in 
the  formulation  of  functional  relations  in  psychological  research.  As  we 
shall  demonstrate,  theory  is  used  to  isolate  groups  of  variables  into  a  system 
of  functional  equations.  It  is  also  used  (a)  to  identify  relevant  causes  that 
should  be  included  in  a  functional  equation,  (b)  to  build  a  case  that  causes 
not  included  in  a  functional  equation  are  not  relevant  causes,  and  (c)  to 
specify  the  causal  ordering  of  equations  or  subsystems  within  a  system  of 
equations.  Finally,  we  will  illustrate  the  intrinsic  interplay  between  theory 
and  data  in  the  conduct  of  a  confirmatory  analysis. 

2 .  CONDITIONS  FOR  CONFIRMATORY  ANALY5 IS  AND  CAUSAL  INFERENCE 

The  objective  of  this  section  is  to  describe  the  conditions  which  justify 
the  use  of  confirmatory  analysis  to  evaluate  whether  the  causal  hypotheses 
indicated  by  functional  relations  have  scientific  utility.  Ten  conditions  are 
introduced  and  discussed.  The  first  seven  conditions  pertain  to  the 
appropriateness  of  a  theoretical  model  for  confirmatory  analysis  and  causal 
inference.  Reasonable  satisfaction  of  these  conditions  suggests  a 


well-developed,  stable  theoretical  model  in  which  self-contained 
(probabilistic)  functional  equations  specify  hypothesized  causal  relations 
among  causes  and  effects.  These  conditions  are:  (1)  formal  statement  of 
theory  in  terms  of  a  structural  model,  (2)  theoretical  rationale  for  causal 
hypotheses,  (3)  specification  of  causal  order,  (4)  specification  of  causal 
direction,  (5)  self-contained  functional  equations,  (6)  specification  of 
boundaries,  and  (7)  stability  of  the  structural  model. 

The  final  three  conditions  concern  the  operational  aspects  of 
confirmatory  analysis  in  a  population  or  sample.  The  conditions  are:  (8) 
operationalization  of  variables,  (9)  empirical  confirmation  of  predictions  i: 
empirical  support  for  functional  equations;  and  (10)  empirical  confirmation  of 
predictions  II:  fit  between  structural  (i.e.,  theoretical)  model  and 
empirical  data.  The  presentation  of  Conditions  8  through  10  focuses  on 
variables  associated  with  directly  observable  events,  which  are  referred  to  as 
"manifest  variables".  As  noted  earlier,  an  overview  of  the  latent  variable 
form  of  analysis  is  presented  separately  in  Section  4.  Finally,  while 
reasonable  satisfaction  of  the  10  conditions  to  be  described  here  is  required 
for  causal  inference,  we  emphasize  their  role  in  confirmatory  analysis  in  this 
section  and  their  role  in  causal  inference  in  Section  3. 

Conditions  Pertaining  to  Appropr iateness 
of  Theoretical  Models 

Condition  1:  Formal  Statement  of  Theory  in  Terms  of  a  Structural  Model 

Any  attempt  to  explain  the  occurrence  of  natural  events  in  terms  of 
functional  relations  must  begin  with  a  we 11 -dev el oped  theory.  By  the  term 
"theory"  we  mean  a  set  (or  sets)  of  interrelated  causal  hypotheses  which 
attempt  to  explain  the  occurrence  of  phenomena  -  physical,  biological,  social. 
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cultural,  or  psychological  (Singh,  1975).  The  basic  components  of  a  theory, 
or  a  theoretical  model,  are  as  follows  (cf.  EUbin,  1976;  Merton,  1968; 

Singh,  1975): 

(1)  Phenomena,  or  the  variables  that  act  as  causes  and  effects. 

(2)  Causal  connections  among  the  variables.  A  causal  connection 
refers  to  the  hypothesized  causal  association  between  one  cause  and  one  effect 
(e.g.,  x ^  ■+  y^).  The  total  pattern  or  structure  of  causal  connections 

among  ordered  variables,  where  order  reflects  the  natural  sequence  of 
occurrences  of  events  represented  by  variables,  is  the  essence  of  a 
theoretical  model.  It  is  this  structure  of  causal  connections  that  provides 
the  basis  for  the  development  of  functional  relations  and  functional 
equations,  which  relate  each  effect  to  all  of  its  presumed  causes. 
Consequently,  the  structure  of  causal  connections  should  specify  the  specific 
variables  that  are  to  be  related  functionally,  the  causal  ordering  among  the 
variables,  the  direction  of  causation  for  each  causal  connection 
(unidirectional  or  reciprocal) ,  and  the  function  that  relates  each  effect 
variable  to  all  of  its  relevant  causal  variables.  The  last  point  implies  that 
the  structure  of  causal  connections,  or  the  system  of  functional 
equations,  should  be  self-contained. 

(3)  A  theoretical  rationale  for  each  causal  hypothesis  (causal 
connection)  that  describes  the  processes  through  which  a  cause  acts  on 
(operates  on,  produces)  an  effect. 

(4)  Boundaries,  which  specify  the  contexts  (e.g.,  types  of  subjects 
and  situations)  within  vdiich  the  functional  relations/equations  are  expected 
to  hold. 

(5)  Stabil ity,  which  implies  that  the  hypothesized  structure  of  causal 
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connections  will  be  consistent  over  specified  time  intervals.  Inasmuch  as  the 
structure  of  causal  connections  is  represented  by  a  system  of  functional 
equations,  stability  suggests  that  the  functional  equations  will  be  invariant 
over  specified  time  intervals. 

In  its  initial  development,  a  theory  is  based  typically  on  induction, 
wherein  observations  of  events  lead  to  hypotheses  that  certain  variables  are 
related  causally  and  that  certain  processes  or  functions  generate  the 
relations.  In  this  regard,  previous  experience  will  often  suggest  to  the 
theory  builder  the  form  that  functional  relations  might  take.  Additional 
theoretical  work  is  often  needed,  however,  to  develop  the  theory  so  that  it  is 
amenable  to  confirmatory  analysis.  This  development  includes  consideration  of 
the  form  of  the  theory  with  respect  to  (a)  its  breadth  or  elaborateness;  (b) 
its  depth,  which  is  reflected  by  the  complexity  of  variables  and  the  molarity 
of  causal  hypotheses;  and  (c)  the  structure  of  causal  connections,  which  is 
typically  stated  in  the  form  of  a  structural  model.  Each  of  these 
considerations  is  discussed  below.  We  will  then  proceed  to  discussions  of  the 
other  aspects  of  a  theory. 

Form  of  a  Theory  -  Elaborateness 

A  theory  may  be  very  simple  and  involve  just  one  effect  and  the  cause (s) 
that  is  (are)  presumed  to  be  functionally  related  to  that  effect.  For 
example,  one  might  hypothesize  that  increases  in  role  expectations, 
communicated  by  management  to  subordinates  regarding  q^iality  and  quantity  of 
subordinates'  performance,  may  cause  subordinates  to  perceive  that  they  have 
too  much  work  to  do  (i.e.,  are  overloaded — cf.  James  &  Sells,  1981).  This  is 
a  "nonelaborate  theory",  consisting  of  just  one  effect  (role  overload)  and  two 
causes  (expected  quality  of  role  performance  and  expected  quantity  of  role 
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performance).  Nonelaborate  theories  are  acceptable  from  the  perspective  of 
confirmatory  analysis  if  the  model  is  self-contained,  which  is  to  say  that  all 
relevant  causes  of  role  overload  are  contained  in  the  theoretical  model  (i.e., 
the  model  is  self-contained) . 

A  theory  may  be  made  more  elaborate  by  adding  effects,  and  as  necessary, 
other  relevant  causes  of  those  effects.  For  example,  perceptions  of  role 
overload  may  be  thought  to  cause  state  anxiety,  defined  as  "subjective, 
consciously  perceived  feelings  of  tension,  apprehension,  [and]  nervousness, 
accompanied  by  or  associated  with  activation  of  the  autonomic  nervous  system 
(ANS)"  (Spielberger ,  1977,  p.  176).  The  guiding  rule  is  that  one  should 
include  all  relevant  causes  of  each  effect  in  the  theoretical  model.  In  this 
example,  if  no  other  causal  variables  are  added  to  the  model,  then  the  theory 
builder  is  assuning  that  role  overload  is  the  only  relevant  cause  of  state 
anxiety. 

Finally,  one  can  develop  highly  elaborate  theories.  Tb  pursue  the 
present  illustration,  role  expectations,  role  overload,  and  anxiety  might  be 
viewed  as  a  subsystem  of  a  more  general  system,  and  one  might  propose  both 
direct  and  indirect  causes  of  expected  quantity  and  quality  of  work  (e.g., 
increases  in  demand  for  product) ,  as  well  as  direct  and  indirect  influences  of 
anxiety  on  other  psychological  phenomena,  such  as  performance  and  withdrawal 
behaviors.  The  rule  remains;  each  time  the  theory  builder  enters  an  effect 
into  the  theoretical  model,  he  or  she  must  also  enter  all  relevant  causes  of 
that  effect. 

In  later  discussion  it  will  be  argued  that  the  more  elaborate  the  theory, 
the  more  "faith"  one  may  have  in  the  results  of  a  confirmatory  analysis. 
Nevertheless,  confirmatory  analysis  may  be  applied  to  nonelaborate  or 
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"moderately"  elaborate  theories  insofar  as  the  theories  are  self-contained. 

It  must  be  recognized,  however,  that  nonelaborate  and  moderately  elaborate 
theories  are  typically  subsystems  chosen  from  more  global  theoretical 
subsystems  or  systems.  While  acceptable,  such  selection  obviously  limits  the 
explanatory  value  of  the  chosen  theory  because  the  functioning  of  a  specific 
subsystem  depends  on  the  functioning  of  more  general  sybsystems  and  systems. 
For  example,  expected  quality  and  quantity  of  role  performance  may  be  the 
only  relevant  causes  of  role  overload,  but  this  self-contained  model  will  be 
influenced  by  the  causes  of  expected  quality  and  quantity  of  role  performance. 

Form  of  a  Theory  -  Molar  Causal  Connections  and  Mediating  Meehan ians 
In  addition  to  elaborateness  or  breadth  of  a  theory,  it  is  also 
possible  to  view  a  theory  in  terms  of  its  depth  by  assessing  the 
"complexity"  of  variables  and  "molarity"  of  causal  connections  included  in 
functional  relations.  Tb  contrast  extremes,  molar  "refers  to  causal  laws 
stated  in  terms  of  large  and  often  complex  objects",  vhereas  micro,  or 
micromed iation ,  “refers  to  the  specification  of  causal  connections  at  a 
level  of  smaller  particles  that  make  up  the  molar  objects  on  a  finer  time 
scale"  (Cook  &  Campbell,  1979,  p.  32).  Tb  illustate,  a  causal  hypothesis 
that  perceived  role  overload  will  result  in  state  anxiety  implies  a  molar 
causal  connection  between  two  complex  variables.  A  more  specific 
micromed iational  model,  or  "micromed iational  chain",  is  presented  in  later 
discussions  of  theoretical  rationale  for  causal  hypotheses  (Condition  2), 
vrtjere  it  is  postulated  that  perceived  role  overload  produces  state  anxiety 
by  a  model  such  as:  role  overload  -*  anticipated  failure-*-  anticipated 
negative  consequences  of  failure  (e.g.,  reprimand)  -*-  state  anxiety. 
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As  discussed  by  Cook  and  Campbell  (1979),  it  is  not  only  acceptable 
but  often  necessary  to  employ  theoretical  models  involving  molar  causal 
connections  among  complex  variables.  It  follows  that  the  more 
micromedia ting  variables  which  serve  to  connect  complex  variables  will  not 
be  included  explicitly  as  separate  variables  in  the  theoretical  model, 
and  therefore,  functional  relations  and  equations.  It  is  possible, 
however,  to  attempt  to  identify  at  least  the  more  important  of  these 
mediating  variables,  and  to  propose  how  these  variables  enter  into  the 
causal  processes.  That  is,  one  uses  mediating  variables  to  attempt  to 
explain  how  one  complex  variable  produces  or  acts  on  another 
complex  variable.  Thus,  as  in  the  example  above,  role  overload  and  state 
anxiety  may  be  the  only  variables  included  in  a  theoretical  model.  But 
reference  may  be  made  to  mediating  processes,  such  as  anticipated  failure 
and  negative  consequences  resulting  from  failure,  that  lead  to  associations 
between  role  overload  and  state  anxiety.  We  shall  use  the  term  mediating 
mechanism  to  refer  to  an  intervening  or  mediating  variable  that  (a)  is 
not  included  explicitly  in  a  theoretical  model,  but  (b)  is  used  to  help  to 
explain  the  processes  by  which  a  complex  cause  produces  a  complex  effect  in 
a  molar  causal  connection. 

Failure  of  a  mediating  mechanism  to  operate  as  expected  for  some,  but 
not  all  or  even  most,  subjects  may  be  regarded  as  a  relatively  randan  event 
and  therefore  one  form  of  random  shock.  Such  failures  render  molar  causal 
connections,  and  therefore  functional  relations  and  equations,  both 
fallible  and  probabilistic  (Cook  &  Campbell,  1979).  That  is,  a  cause  may 
not  produce  an  effect  because  a  mediating  mechanism  on  which  the  causal 
connection  depends  is  not  functioning  in  the  presorted  manner  for  some 
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subjects.  Thus,  we  might  say  that  a  cause  will  produce  an  effect  dependent 
on  the  occurrence  of  the  presumed  mediating  mechanisms  for  all  subjects, 
which  is  a  probabilistic  statement  and  one  of  the  reasons  why  functional 
equations  are  represented  in  the  form  of  probabilistic  equations.  Finally, 
the  probabilistic  nature  of  a  causal  hypothesis  is  associated  logically 
with  the  molarity  of  the  causal  connection.  Cook  and  Campbell  (1979,  p. 

33)  noted  that  "...it  is  probably  the  case  that  the  more  molar  the  causal 
assertion  and  the  longer  and  more  unspecified  the  assumed  micromediational 
chain,  the  more  fallible  the  causal  law  and  the  more  probabilistic  its 
supporting  evidence." 

lb  avoid  potential  confusion,  it  is  helpful  to  distinguish  between  the 
role  of  mediating  mechanisms  and  relevant  causes  in  functional  equations. 
Consider  the  following  functional  equation  that  relates  tv*>  complex 
variables:  y  «  B  x  +  d...  If  w  is  a  relevant  cause  of  y, 

i.  - 

then  w  must  be  entered  into  the  equation;  that  is,  y  =  B^x  +  + 

d^.  Note  that  (a)  x  is  still  presumed  to  have  a  direct  effect  on 

(b)  the  equation  is  not  self-contained  without  explicit  inclusion  of 
w,  and  (c)  w  may  or  may  not  be  a  complex  variable.  However,  if  w  is 
a  mediating  mechanism  for  the  function  relating  y  to  x,  then  the  model 
has  the  form  x  -*■  w This  model  indicates  two  functional 
equations,  namely  w  =  Bx  +  d  and  ^  =  B  w  +  d  . 

y  Vr"  Jr 

This  suggests  that  explicit  inclusion  of  w  renders  the  (x,^) 
relationship  indirect,  where  the  effect  of  £  on  ^  must  now  pass 
through  w.  Furthermore,  x  is  no  longer  a  relevant  cause  of  ^ 
because,  by  definition,  relevant  causes  must  be  directly  related  to 
effects.  On  the  other  hand,  if  w  is  unmeasured,  then  relative  to  the 
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molarity  of  the  theory,  x  is  a  direct  cause  of  y  and  the  molar 
connection  may  be  regarded  as  self-contained. 

In  summary,  many  theories  in  psychology  are  nonelaborate  or 
moderately  elaborate,  involving  molar  causal  connections  linking  a  few 
complex  variables.  Such  theories  may  be  quite  useful  for  testing  selected, 
self-contained  causal  subsystems.  However,  more  elaborate  theories 
involving  more  holistic  subsystems  or  systems,  as  well  as  explication  and 
inclusion  of  mediating  mechanisms  in  theoretical  models,  are  clearly  the 
long-term  objectives.  Nevertheless,  regardless  of  the  scope  of  a 
particular  theoretical  model,  if  it  is  self-contained  then  it  is  possible 
to  proceed  to  the  next  step  in  confirmatory  analysis,  which  is  the 
development  of  a  structural  model. 

Development  of  Structural  Models 

It  must  be  possible  to  propose  a  theory  in  quantitative  terms  if  it  is 
to  be  subjected  bo  confirmatory  analysis.  The  process  of  confirmatory 
analysis  typically  begins  by  specifying  the  presumed  structure  of  causal 
connections  among  the  variables  in  the  form  of  a  graphic  model. 

Consider  the  graphic  model  of  the  oversimplified  theory  regarding 
causes  and  effects  of  role  overload  in  Figure  2.1.  (All  variables  are 
regarded  as  complex  and  .thus  all  causal  connections  are  molar.)  The  model 
predicts  that  (a)  expected  quality  (x^)  and  expected  quantity  (x2) 
of  role  performance  are  causes  of  perceived  role  overload  ) ;  (b) 

role  overload  (^)  is  a  cause  of  state  anxiety  (y2) *  and  (c) 
expected  quality  (x^)  and  quantity  (x^  of  role  performance  do  not 
cause  state  anxiety  <^2)  directly,  rather  they  affect  it  indirectly 
through  their  effects  on  the  intervening  role  overload  perceptions 


(y^).  Representations  of  theory  or  theoretical  models  are  referred  to 
as  causal,  or  structural ,  models;  the  term  "structural"  is  used  to  denote 
that  the  structure  of  the  causal  connections  has  been  specified. 

Insert  Figure  2. 1  about  here 


Definition  of  terms.  In  regard  to  the  structural  model ,  y^  and 
y2  are  referred  to  as  endogenous  variables;  an  endogenous  variable 
is  a  dependent  variable  (i.e.,  effect)  whose  occurrence  is  to  be  explained 
by  the  structural  model,  x^  and  are  exogenous  variables. 

An  exogenous  variable  is  a  predetermined  variable  that  acts  as  a  cause  but 
whose  occurrence  is  not  to  be  explained  by  the  model. 

The  curved  (double-headed)  arrow  between  x^  and  x2  means  that 
although  the  exogenous  variables  may  be  related,  their  relationship  is  not 
to  be  explained  by  this  particular  structural  model.  The  straight 
(single-headed)  arrows  in  the  model  represent  the  hypothesized  causal 
connections.  The  lack  of  straight  arrows  between  the  two  exogenous 
variables  (x^  and  x2)  and  the  endogenous  y2  variable  reflects 
the  hypotheses  that  the  exogenous  variables  do  not  have  direct  causal 
connections  with  y2> 

Associated  wnth  each  straight  arrow  is  a  structural  parameter 

(i.e.,  B,B,B).  The  structural  parameters  assume 
i-1-1  ~Xl*2  ~X2*-1 

values  that  reflect  the  strengths  of  the  causal  relationships. 
Specifically,  each  structural  parameter  reflects  the  amount  of  change 
in  an  effect  (endogenous  variable)  that  results  from  a  unit  of  change 
in  a  cause  (exogenous  variable  or  preceding  endogenous  variable),  with 


is  a 


all  other  causes  of  that  effect  held  constant.  For  example,  B 

*-1-1 

shorthand  notation  for  B  ,  which  indicates  that  x„,  also  a  cause 

of  is  held  constant  when  the  effect  of  x^  on  is  ascertained. 

Thus,  B  is  the  unique  effect  of  x,  on  y..  A  key 

“*1-1  i  ± 

objective  of  confirmatory  analysis  is  to  estimate  the  values  of  the 

structural  parameters;  this  process  is  addressed  in  Condition  9. 

The  small  "d's"  associated  with  each  of  the  two  endogenous  variables 

are  the  disturbance  terms.  Disturbance  terms  involve  variation  in  an 

endogenous  variable  that  is  not  to  be  attributed  to  the  causes  of  that 

variable  included  explicitly  in  a  structural  model.  For  example,  d 

~^1 

accounts  for  all  variation  in  that  cannot  be  attributed  to  x^ 

and  x_.  The  straight  arrow  from  d  to  y,  suggests  that 
-  "*1  ~ 
these  other  sources  of  variation  are  also  causes  of  In  the 

structural  model,  the  sources  of  variation  in  d  ,  as  well  as 

d  ,  include  random  shocks  and/or  unmeasured  or  omitted  causes,  which 

may  or  may  not  be  relevant  causes.  In  an  actual  confirmatory  analysis  with 

realized  values  on  variables,  a  disturbance  term  may  also  include  (a) 

random  measurement  errors  in  the  effect  and  the  causes,  where  the  primary 

concern  is  error  in  one  or  more  of  the  causes;  and  (b)  nonrandom 

measurement  errors,  such  as  bias  in  the  scales  of  measurement  and  method 

variance.  Random  shocks  and  unmeasured  causes  are  discussed  in  greater 

detail  in  Condition  5;  random  and  nonrandom  measurement  errors  are 

addressed  in  Condition  8. 

Functional  equations.  The  formal  structural  model  in  Figure  2.1 


specifies  the  form  of  the  functional  equations  that  are  to  be  used  to 
represent  the  functional  relations.  There  are  two  endogenous  variables  in 


Figure  2.1,  and  thus  two  functional  relations,  namely  ^  =  _f  (x^, 
x2,  d^)  and  ^  ~  d2).  Assuming  linearity  and 

additivity,  the  functions  may  be  viewed  as  taking  the  form  of  weighted, 
additive  combinations.  Specifically,  the  functional  equations  are,  in 
deviation  form: 

(2''> 

+  %  12-21 
To  sunmarTze,  the  structural  model  indicates  the  functional 

relations  that  relate  effects  to  causes  and  specifies  the  form(s)  of  the 

functional  equations  that  are  to  be  used  to  represent  functional  relations. 

However,  specifying  functional  relations  and  equations  does  not 

necessarily  mean  that  the  processes  by  which  causes  produce  effects  have 

been  explained.  For  this  we  shall  often  need  a  theoretical  rationale. 

Condition  _2:  Theoretical  Rationale  for  Causal  Hypotheses 

Simon  (1977)  suggested  that  some  functional  relations  are 

self-explanatory  in  the  sense  that  the  processes  by  vhich  a  cause  produces 

an  effect  are  self-evident.  Tb  illustrate,  the  processes  by  which 

fertilizer  and  rain  act  to  influence  the  growth  of  vheat  need  not  be 

explained  by  additional  theory  or  mediating  mechanisms  because  wheat  growth 

is  generally  accepted  to  be  a  function  of  rain  and  fertilizer  (although 

type  of  fertilizer  is  a  relevant  question).  Darlington  (1968)  made  a 

similar  point  regarding  snowfall  and  increases  in  automobile  accidents. 

However,  many  relations  among  variables  are  more  subjective,  and  what  may 

appear  to  be  a  functional  relation  is  instead  merely  a  covariation  among 

variables  that  are  not  causally  related.  A  classic  example  used  by 

philosophers  to  indicate  covariation  without  causation  is  the  correlation 
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between  the  occurrence  of  night  and  the  occurrence  of  day.  This 
covariation  is  spurious;  covariation  between  night  and  day  is  due  to  common 
causes  (e.g.,  the  earth's  rotation  about  its  axis  and  the  sun).  This 
example  highlights  the  point  that  an  attempt  to  specify  the  processes  by 
vihich  night  causes  day,  or  vice  versa,  would  likely  preclude  an  erroneous 
causal  hypothesis  that  night  is  a  function  of  day. 

An  attempt  must  be  made  to  separate  nonobvious  functional  relations 
from  simple  covariation  by  proposing  a  theoretical  rationale  for  the 
functional  relations.  Prior  experience  and  observation  are  helpful  in 
identifyng  covariation  among  variables,  where  the  covariation  may  reflect  a 
functional  relation.  The  theoretical  rationale  is  typically  obtained  by 
development  of  a  theory  from  careful  observations,  or  by  deducing  from 
an  existing  theory  a  proposal  of  how  causes  produce  effects;  that  is,  an 
explanation  of  why  variables  covary.  It  is  also  typical  that  the 
theoretical  rationale  involves  the  introduction  of  mediating  mechanisms  to 
help  to  explain  molar  causal  connections  among  complex  variables. 

lb  illustrate  these  points,  a  theoretical  rationale  is  developed 
for  both  the  causes  of  role  overload  and  the  role  overload  ->■  state  anxiety 
causal  connection  in  Figure  2.1.  Beginning  with  role  overload,  prior 
experience  in  work  environments  may  suggest  that  management's  decision  to 
increase  expectations  for  quality  (x^)  and  quantity  (x2)  of  role 
performance  results  in  perceptions  of  overload  (j^)  on  the  part  of 
subordinates.  While  this  observation  implies  a  functional  relation  between 
^  and  x^  and  x2,  does  not  necessarily  specify  how  it  is  that 
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quality/quantity  expectations  produce  role  overload.  A  theory  of  "stress" 
might  serve  this  purpose  (cf.  Katz  &  Kahn,  1978).  For  example,  it  might  be 
postulated  that  increased  expectations  for  higher  quality  output  in  shorter 
time  spans  results  in  perceived  overload  because  these  expectations  exceed 
both  physical  and  personnel  resources  (e.g.,  demands  for  quality  exceed  the 
tolerances  of  machinery  and  the  technological  training  of  personnel)  as  well 
as  time  resources  (e.g.,  the  quantity  of  expected  output  cannot  be  achieved  in 
the  existing  wark^day  or  wark-week) .  In  short,  perceived  overload  occurs  when 
role  prescriptions  and  demands  exceed  available  resources  and  time.1 

The  explanation  above  assumes  the  presence  and  operation  of  a  number  of 
mediating  mechanisms.  For  example,  the  presumed  causal  connection  between 
increased  demands  for  quality  and  perceived  overload  is  a  molar  causal 
connection  between  two  complex  variables.  This  relationship  depends  on  such 
things  as  accurate  communication  of  expectations  to  subordinates  from 
management,  cognitive  comparisons  of  resources  and  demands  on  the  part  of 
subordinates,  and  assignment  of  similar  meaning  to  the  environment  by  all  or 
most  subordinates  (i.e.,  all  or  most  subordinates  perceive  similar  levels  of 
role  overload).  The  molar  causal  connection  may  in  fact  be  true,  but  can  only 
be  stated  as  a  probability  because  the  mediating  mechanisms  might  not  function 
in  the  presumed  manner  for  all  subordinates  (e.g.,  a  partially  brain-damaged 
subordinate  may  not  be  able  to  make  the  presumed  cognitive  comparisons) . 

A  theory  of  stress  may  also  be  used  to  derive  a  theoretical  rationale  for 
the  molar  causal  connection  between  ^  (state  anxiety)  and  (role 
overload).  Subordinates'  perceptions  that  their  role  performance  is  adversely 
affected  by  inadequate  resources  should  signify  to  them  that  they  are  not 
likely  to  succeed.  This  perceived  or  anticipated  likelihood  of  failure 
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should  arouse  conscious  affective  states  of  tension  and  apprehension 
because  failure  should  result  in  negative  consequences,  such  as  reprimands, 
failure  to  be  promoted,  or  dismissals.  Thus,  the  causal  connection^  -*■ 

^2  is  dependent  on  a  theoretical  rationale  involving  several  mediating  mechanisms, 
namely  ^  (role  overload)  -*•  anticipated  likelihood  of  failure  ->■ 
anticipated  likelihood  of  negative  consequences  (state  anxiety)  . 

Here  again  mediating  mechanisms  render  the  functional  relation 
probabilistic  if  any  one  of  the  mechanisms  does  not  operate  as  expected  for 
all  subjects.  Tb  illustrate,  while  role  overload  may  be  perceived,  such 
perceptions  might  not  lead  to  feelings  of  tension  and  apprehension  if 
subordinates  do  not  anticipate  an  association  between  overload  and  failure. 
Consequently,  the  occurrence  of  role  overload  might  not  lead  invariably  to 
anxiety  for  all  subordinates  and  the  functional  relation  is  probabil istic. 

In  conclusion,  many  functional  relations  in  psychology  are  nonobvious  and 
involve  molar  causal  connections  among  complex  variables.  A  theoretical 
rationale  involving  mediating  mechanisms  is  therefore  required  to  propose  how 
causes  produce  effects.  An  important  contribution  of  a  theoretical  rationale 
is  that  it  assists  in  differentiating  functional  relations  from  simple 
covariation.  Another  contribution  is  the  identification  of  at  least  some  of 
the  mediating  mechanisms  on  which  functional  relations  depend.  In  the 
situation  where  a  functional  relation  receives  only  weak  probabilistic  support 
in  a  confirmatory  analysis,  or  is  disconfirmed  by  that  analysis,  the  presumed 
operation  of  mediating  mechanisms  is  a  key  source  for  reevaluation. 

Condition  Specification  of  Causal  Order 

The  causal  order  is  typically  thought  of  as  a  temporal  order  or  sequence 
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in  which  causes  occur  before  effects;  a  causal  order  must  be  specified  for 
each  causal  relation.  It  is  also  typical  to  assume  that  a  time  interval,  or 
causal  interval ,  intervenes  between  the  occurrence  of  a  cause  and  the 
occurrence  of  an  effect.  Tins  assumption  may  prove  to  be  a  problem,  however, 
because  "in  many  instances  where  the  scientist  speaks  of  cause  (e.g.,  'force 
causes  acceleration'),  no  time  sequence  is  involved"  (Simon,  1977,  p.  82). 
Simon  (1977)  "avoided"  the  time  sequence  and  causal  interval  issue  by  arguing 
that  the  causal  order  anong  a  set  of  variables  is  given  by  the  order  in  which 
variables  occur  naturally  in  an  asymmetric,  self-contained  system  of 
functional  relations,  and,  therefore,  the  order  in  which  one  would  solve  for 
the  values  of  the  variables  in  a  system  of  ordered  functional  equations. 

Figure  2.1  and  functional  equations  2.1  and  2.2  illustrate  a  presumed 
causal  order;  the  ordering  of  equations  2.1  and  2.2  denotes  that  x^  and 
x2  occur  before  and  that  ^  occurs  before  Causal  intervals  are 
not  specified  for  the  causal  connections.  This  does  not  imply  that  causal 
intervals  are  unimportant  (in  our  opinion) ,  and  we  will  discuss  causal 
intervals  both  below  (briefly)  and  in  a  number  of  the  remaining  conditions. 
Nevertheless,  the  point  here  is  that  the  causal  ordering  among  variables, 
with  the  exception  of  purely  exogenous  variables  (i.e.,  and  x^) , 
is  given  by  the  presumed  ordering  of  functional  relations  and,  therefore, 
the  ordering  among  the  functional  equations.  This  rationale  is  especially 
pertinent  to  cross-sectional  designs  where  no  time  intervals  are  involved 
empirically  (i.e.,  all  data  are  collected  at  the  same  time).  On  the  other 
hand,  this  reasoning  means  that  one  is  relying  on  theory  to  establish  a 
causal  ordering  among  variables  inasnuch  as  the  functional  equations  are 
themselves  given  by  theory.  This  may  also  prove  to  be  problemsome,  as 
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discussed  below. 

The  important  question  with  respect  to  causal  order  is  whether  the 
presumed  causal  ordering  is  correct.  Consider,  for  example,  that  causal  crder 
is  easily  established  for  many  naturally  occurring  events,  such  as  the  use  of 
heroin  precedes  withdrawal  symptoms.  On  the  other  hand,  causal  ordering  among 
other  variables  is  not  generally  obvious  and  is  subject  to  misspecification. 

TP  illustrate,  Zajonc  (1980)  challenged  the  widely  accepted  model  that 
perceptions  (P)  of  situations  (S)  precede  affective  reactions  (A)  to 
situations  (i.e.,  S P  -*■  A) .  Focusing  on  the  cognitive  processing  of 
"hot  cognitions"  (e.g.,  perceptions  of  danger) ,  Zajonc  proposed  and  supported 
a  model  of  the  form  S  -*■  A  -*■  P,  which  suggests  that  differentiated 
cognitive  interpretations  ( P )  are  a  result  of  an  attempt  to  explain  the 
affect  (A)  which  occurred  almost  simultaneously  with  the  presentation  of 
S.  If  generalized  to  the  structural  model  in  Figure  2.1,  one  might  argue 
that  anxiety  (^)  occurs  prior  to  perceptions  of  role  overload  (^)  . 

This  illustration  serves  to  point  out  that  (a)  causal  ordering  among 
variables,  especially  subjective  constructs  such  as  perceptions  and 
affect,  is  often  not  obvious;  (b)  one  must  rely  frequently  on  theory  to 
propose  a  causal  order;  and  (c)  the  theory  may  be  wrong.  Now,  it  is  true 
that  a  causal  order  must  be  specified  before  a  confirmatory  analysis  is 
conducted,  and  that  one  should  not  explore  different  causal  orders  with 
the  same  set  of  data  in  order  to  optimize  goodness  of  fit  between  a  model 
and  data  (Duncan,  1970,  1975).  But  it  is  also  true  that  one  may  test  for 
alternative  causal  orders  with  the  same  set  of  data  (cf.  Billings  & 

Wfoten,  1978).  This  is  possible  only  if  structural  models  specifying 
different  orders  are  proposed  prior  to  the  analysis  of  data.  In  other 
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words,  if  different  theories  suggest  different  causal  orders,  and  if  the 
theories  furnish  structural  models  with  conflicting  (and  empirically 
testable)  predictions,  then  it  is  possible  to  determine  vdiich  of  the 
models,  if  any,  has  the  best  fit  with  the  data  (cf.  Griffin,  1977). 

As  a  final  point,  it  is  often  thought  that  the  establishment  of  a 
causal  order  is  facilitated  by  the  use  of  a  time  series  form  of  design, 
where  measurement  of  presumed  causes  precedes  measurement  of  presumed 
effects  by  a  discrete,  identifiable  time  period  (cf.  Ostrom,  1978)  . 
However,  measuring  presumed  causes  before  presumed  effects  in  no  way 
implies  that  the  true  causal  order  is  consistent  with  order  of  measurement 
(cf.  Rozelle  &  Campbell,  1969) .  Moreover,  the  times  of  measurement 
(measurement  intervals)  must  correspond  closely  to  the  true  causal 
intervals  in  a  time  series  design  (cf.  Kenny,  1979)  ,  which  raises  an 
obvious  problem  for  psychology  inasmuch  as  causal  intervals  are  often 
unknown.  Time  series  designs  are  discussed  in  the  next  condition  and  in 
later  conditions.  The  key  points  here  are  thac  causal  ordering  and  time 
of  measurement  in  a  time  series  analysis  should  be  dictated  by  prior 
knowledge  of  causal  intervals  and  a  structural  model;  time  of  measurement 
should  never  dictate  causal  order,  causal  intervals,  or  the  structural 
model . 

Condition  4^  Specification  of  Causal  Direction 

Up  to  now  we  have  focused  on  unidirectional,  or  recursive  relations 
of  the  form  x  This  emphasis  was  purposeful  and  designed  to 

maintain  consistency  with  Simon's  (1977)  view  of  causality  as  an 
asymmetric  functional  relation.  However,  studies  of  causality  in  science 
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also  allow  for  nonrecursive  relations  which  take  the  form  ^  Yj . 

This  form  implies  reciprocal  causation  between  ^  and  y. , 
where  the  variables  mutually  affect  one  another.  The  criterion  for 
Cordition  4  is  that  all  direct  causal  connections  within  a  structural 
model  must  be  specified  as  either  recursive  or  nonrecursive. 

Nonrecursive  Models 

A  nonrecursive  model  involving  reciprocal  causation  is  presented  and 
discussed  below.  The  discussion  includes  an  interpretation  of  causality 
which,  following  an  amendment  to  Simon' s  views,  allows  for  reciprocal 
causation.  A  distinction  is  then  made  between  nonrecursive  and  cyclical 
recursive  models. 

A  nonrecursive  structural  model  based  on  a  study  by  Kritzer  (1977)  and 

used  by  James  and  Singh  (1978)  to  illustrate  reciprocal  causation  is  presented 

in  Figure  2.2.  As  with  recursive  models,  the  designate  exogenous 

events,  the  ^  designate  endogenous  events,  the  B's  reflect  structural 

parameters,  and  the  d  designate  disturbance  terms.  Functional 

'•**1 

equations  for  nonrecursive  models  are  developed  in  the  same  manner  as  those 
for  recursive  models;  an  equation  is  constructed  for  each  effect,  and  includes 
all  causes  having  a  direct  causal  connection  with  that  effect.  The  functional 
equations  for  Figure  2.2  are  (assuning  linearity  and  deviation  scores) : 


y,  =  B  y„  +  B  X,  +  B  x„  +  d 

h  -^1*2*2  -Zl\-I  -^1*2-2  -*i 


(2.3) 


y_  =  B  y,  +  B  X,  +  B  .  x .  +  d 

-  “3^1  I  “^3  -  ^2 

By  convention,  endogenous  variables  involved  in  a  reciprocal  relation 
are  included  in  a  functional  equation  before  the  exogenous  variables 
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(e.g ^  is  placed  before  x^  and  in  EJq.  2.3). 

«  ■;  r  *,  ■  ».  ■?>:',T»:r^A’Trv».  v  r  t«  ■  -  ■  ■  rr  -■>”-<  rrrrrrr  v  — ■  .  ■»  ■  «  ■.  ■.  «.  v  ■  ■■»  ■:  .  ■■  ■ 

Insert  Figure  2.2  about  here 

»•  k1  »A  *,  k  ",'Trr*:^  v  '.■rrr''",  *.  ■■  rv  ■’  ■. «,  ■'■rr*.1  ■■*. r  v »  ■  r  vt*  ■  ■- «.  ■?  ■■.■.■■;■■■  »  ■  ■  ■■  *» 1  ■- 

The  structural  model  and  functional  equations  are  based  on  the  following 
causal  hypotheses: 

1.  The  amount  of  violence  employed  by  police  at  a  political  protest 
event  (y  w  the  extent  to  which  police  use  increasingly  more 
dangerous  controls)  is  a  function  of  (a)  the  degree  to  which  police  are 
prepared  for  or  anticipate  violence,  as  indicated  by  the  presence  of  riot 
equipment  (x.) ,  and  (b)  verbal  provocation  by  protesters, 
operationalized  by  the  extent  to  which  obscenities  are  collectively 
shouted  by  demonstrators  (x2) •  Xq  and  x2  are  exogenous  causes 

of^,  and  precede^  in  the  causal  order. 

2.  The  amount  of  violence  evidenced  by  protesters  collectively 

(_^2  operationalized  on  a  scale  ranging  from  protestors'  use  of 
anti^police  slogans  to  protestors'  use  of  weapons)  is  a  function  of  (a) 
the  conscious  attempt  by  protesters  to  effect  normative  controls  on 
violence  (x^) ,  and  (b)  the  number  of  (perceived)  arbitrary  arrests  of 
protesters  made  by  police  (x^).  £3  and  X4  are  exogenous 

causes  of  y2  ar*d  precede  the  causal  order. 

3.  Police  violence  (^)  and  protester  violence  (^2)  are 
reciprocal  causes  of  each  other.  A  causal  order  is  not  especially 
relevant  here,  either  police  or  protesters  may  initiate  violence.  The 
point  is  that  an  initiation  or  escalation  of  violence  on  the  part  of 
police  (protesters)  is  followed  by  an  initiation  or  escalation  of  violence 
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by  the  other  party.  Elirthermore,  an  escalation  by  one  party  influences 
escalation  by  the  other,  resulting  in  a  dynamic  system  of  mutual 
causation.  It  is  presumed,  however,  that  the  mutual  escalation  of 
violence  will  level  off,  or  reach  an  "equil ibriut^type  condition"  (cf. 
Namboordiri  et  al.,  1975)  ,  which  in  part  is  determined  by  the  values  on 
the  exogenous  variables  (equilibrium- type  conditions  are  discussed  in 
Condition  7) .  For  exanple,  the  mutual  effects  between  police  and 
protester  violence  is  less  likely  to  reach  the  stage  of  joint  use  of 
weapons  if  protesters  exercise  normative  control  and  police  do  not  have 
riot  equipment  available.  Finally,  and  of  major  importance,  it  is  assumed 
that  the  reciprocal  effects  between  police  and  protester  violence  are 
essentially  instantaneous,  or  at  least  so  rapid  that  reliable  causal 


intervals  cannot  be  determined  for  either  the  ^  -+■  y2  or 


causal  effects. 


The  discussion  above  illustrates  only  one  of  many  reciprocal 
causation  models  for  the  social  sciences.  With  respect  to  psychology, 
reciprocal  causation  is  intrinsic  to  many  theoretical  models.  For 
instance,  reciprocal  causation  is  indicated  in  social  learning  and 
cognitive  social  learning  models  (cf.  Bandura,  1978;  Mahoney,  1977; 
Mischel,  1977) ,  interactional  psychology  (cf.  Bowers,  1973;  Ekehammer, 
1974;  Endler  &  Magnusson,  1976;  Pervin,  1968),  and  social  system  and 
organizational  theory  (cf.  Dansereau,  Graen,  &  Haga,  1975;  James  &  Jones, 
1976;  Roberts,  Hulin,  &  Rousseau,  1978)  .  Furthermore,  confirmatory 
analysis  provides  analytic  techniques  to  test  nonrecursive  models 
involving  reciprocal  causation.  TVo  of  the  most  popular  of  these 


techniques  were  reviewed  recently  in  the  psychological  literature.  These 
methods  are  two-stage  least  squares  (James,  1981;  James  &  Singh,  1978)  and 
maximum  likelihood  estimation  (Maruyama  &  McGarvey,  1980).  A  review  of 
these  procedures  is  beyond  the  scope  of  this  paper,  although  it  is 
noteworthy  that  Kritzer  (1977)  used  the  twa-stage  least  squares  technique 
to  test  (and  support)  the  hypothesized  reciprocal  causation  between  police 
and  protester  violence. 

lb  summarize,  each  causal  connection  in  a  structural  model  must  be 
specified  as  either  recursive  or  nonrecursive.  It  is  also  necessary  to 
stipulate  that  not  all  relations  in  a  structural  model  may  be 
nonrecursive.  This  stipulation  evolves  from  the  "identification" 
question,  which  is  concerned  with  vAiether  sufficient  information  is 
available  to  obtain  unique  mathematical  solutions  for  the  structural 
parameters  in  functional  equations  (cf.  Fisher,  1966;  Theil,  1971). 
Identif ication  is  discussed  in  Conditions  9  and  10.  At  this  time  we  note 
only  that  functional  equations  representing  nonrecursive  relations  are 
identified  if  there  exists  at  least  one  "instrument"  for  each  endogenous 
variable  involved  in  a  reciprocal  relationship.  In  general  terms,  an 
instrument  is  an  exogenous  variable  that  affects  only  one  of  the 
endogenous  variables  involved  in  a  reciprocal  relationship.  For  example, 
each  of  the  exogenous  variables  in  Figure  2.2  is  an  instrunent  because 
each  of  these  variables  is  a  cause  of  either  ^  or  y2'  but  not 
both  jyj  and 

We  now  turn  to  the  apparent  paradox  of  defining  causality  as  an 
asymmetric  functional  relation  and  then  entertaining  the  concept  of 
reciprocal  causation.  To  review  briefly,  the  stipulation  of 
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asynmetric  functional  relations  was  designed  (a)  to  avoid  confusion  of 

causation  with  logical  implication,  and  (b)  to  differentiate  functional 

relations  from  ordinary  mathematical  functions,  where,  for  example, 

2  2 

e  =  me  implies  c  =  e/m.  The  fidelity  of  the  stipulation  is 
preserved  by  stating  that  y2  =  £(Zi'  )  does  not  imply  that 

rule  out,  however,  the  offering  of  a 
causal  hypothesis  that  =  £(y2,  d^).  That  is,  hypothesizing  a 
functional  relation  does  not  automatically  imply  a  reciprocal  form  of 
relation,  but  a  reciprocal  relation  may  be  proposed  as  part  of  a 
structural  model.  For  example,  the  hypothesis  that  child  behavior  = 

_f  (mother  behavior)  does  not  imply  mother  behavior  =  f_( child  behavior) . 
However,  we  may  propose  two  functional  relations,  namely  child  behavior  = 
f (mother  behavior)  and  mother  behavior  =  fjehild  behavior) .  (Based  on 
the  preceding  discussion,  instruments  would  have  to  be  added  to  each  of  the 
functions  before  analyses  could  be  performed.) 

In  sum,  reciprocal  causation  is  amenable  to  confirmatory  analysis  and 
is  consistent  with  a  functional  definition 'of  causality.  It  is  necessary, 
however  to  amend  Simon's  (1977)  definition  of  causality  to  allow  for 
reciprocal  causation.  This  amendment  is:  "Causality  is  a  functional 
relation  among  certain  variables,  or  subsets  of  variables,  in  a 
self-contained  structure,  where  functional  relation  does  not  imply 
reciprocal  relation  but  reciprocal  relations  may  be  postulated." 


) .  This  does  not 


The  concluding  objective  of  this  section  is  to  distinguish  between 


reciprocal  causation  and  "cyclical  causation".  Reciprocal  causation 

refers  to  relations  of  the  form  ^  t  ^ ,  where  the  causal  intervals  for 

the  ^  "*■  and  ^  causal  connections  are  rapid  and 

essentially  indiscernable.  If,  however,  a  causal  interval  can  be 

established  for  the  ^  -*•  ^  and  ^  causal  connections, 

then  one  has  a  structural  model  such  as  shovel  in  Figure  2.3.  The  essence  of 

this  model  is  that  the  occurrence  of  at  time  _t  (i.e.,  causes 

the  occurrence  of  ^  at  time  t+1.  U-e*/  X2t+].)  '  where  the  time 

interval  between  t.  and  _t+l_  is  discernible.  The  occurrence  of  ^2  at 

time  t+1.  then,  through  a  feedback  loop  shown  at  the  top  of  Figure  2.3,  causes 

the  occurrence  of  ^  at  time  t+2^  (i.e.,  Y.lt+2)  *  where  again  the  time 

interval  between  tfl  and  t+2  is  discernable.  This  cycle  repeats  itself  and 

we  have  cyclical  causation. 

v  rrr  v’.vv^vv^tv',  vv  r*  v  1  v  v  vv  ■■  ■.  «*  ■*  ^  ^  >.■«  r  w  r  t  ■■■.  ■  r 

Insert  Figure  2.3  about  here 
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The  structural  model  in  Figure  2.3  is  a  form  of  time  series  design. 

Of  special  importance  is  the  fact  that  it  is  a  recursive  model  inasmuch 
as  all  causal  relations  are  asymmetric.  That  is,  each  causal  relation  is 
recursive  because  a  time  interval  may  be  specified  between  the  occurrence 
of  a  cause  and  the  occurrence  of  an  effect  (cf.  Strotz  &  Wbld,  1971) .  We 
shall  refer  to  this  design  as  a  "cyclical  recursive  model." 

The  functional  equations  for  Figure  2.3  are  shown  below,  y^  and 
are  specified  in  the  subscripts  of  structural  parameters  and  disturbance 
terms  by  a  "1"  or  a  "2"  preceding  the  time  indicator.  ^  at  time  t 
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assumes  the  role  of  an  exogenous  variable  (i.e.,  its  occurrence  is  not  to  be 
explained  by  this  model) . 


^2t+l  =  — 2t+l_,  ltXlt  +  -2t+l 
^lt+2  =  -lt+2,2tl^2t+l  +  -lt+2 
^2t+3  =  -2t+3,lt+2i:-rt+2  +  -2t+3 
^lt+4  =  —lt+4 , 2t+3^2t+3  +  — lt+4 


Linear  relations  and  deviation  scores  are  assumed.  Furthermore,  it  must 
be  assumed  not  only  that  the  causal  intervals  known,  but  also  that  the  times 
of  measurement  of  variables  representing  and  Y2  measurement 

intervals)  correspond  closely  to  the  causal  intervals.  Finally,  as  discussed 
in  greater  detail  in  Condition  7,  the  model  and  functional  equations  must  be 
"stationary".  Stationarity  refers  to  the  stability  of  a  structural  model, 
and  would  be  indicated  here  if  the  structural  parameters  with  common  causes 
and  effects  are  equal  (i.e.,  =  ®it+4,2t+3'  and  %+l,lt 

=  -2t+3,lt+2)  * 

In  conclusion,  while  there  is  much  to  be  said  in  favor  of  reciprocal 
causation  (nonrecursive  relations)  and  cyclical  causation  (cyclical  recursive 
relations)  in  psychological  research,  we  shall  continue  to  employ  simple 
recursive  models  for  illustrative  purposes. 

Condition  5u  Self-Contained  Functional  Equations 

Simon  (1977)  discussed  self-containment  in  the  context  of  an  ordered 
system  of  functional  equations  representing  a  recursive  structural  model. 
However,  an  overview  of  the  basic  logic  and  requirements  of  self-containment 


t 
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is  made  easier  by  focusing  on  a  single  functional  equation.  We  will 
discuss  self-containment  in  this  context. 


As  discussed  previously,  a  (probabilistic)  functional  equation  is 
representative  of  a  causal  relation  only  if  the  functional  equation  is 
self-contained,  and  an  equation  is  self-contained  only  if  all  relevant  causes 
of  the  endogenous  variable  are  included  in  the  functional  equation,  lb 
illustrate,  consider  the  part  of  the  model  in  Figure  2.1  which  deals  with 
causes  of  role  overload.  This  part  is  reproduced  in  Figure  2.4a.  The  linear 
functional  equation  for  role  overload  is,  in  deviation  form: 


y,  =  B  ,  x,  +  B  ,  x_ 


x0  +  (d  =  RS) 
il 


(2.9) 


Insert  Figure  2.4  about  here 


Equation  2.9  is  the  same  as  Eq.  2.1  with  the  exception  that  ci  has 

*1 

been  specified  as  being  made  up  exclusively  of  random  shocks  (RS) .  This 

specfication  implies  that  x  and  x2  ate  the  only  relevant  causes  of 

which  suggests  that  no  other  variable  exists  which  simultaneously  (a) 

has  a  direct,  nonminor  influence  on  (b)  is  stable,  (c)  is  related  to 

x^  and/or  x^,  and  (d)  is  not  linearly  dependent  on  and  x^* 

That  is,  no  other  relevant  causes  exist  for  Consequently,  x^  and 

will  not  be  related  to  d  because,  by  definition,  the  expected 
£  — JL 

value  of  the  covariance  between  a  variable  and  random  shock  is  zero 
[e.g.,  E(x.d  )  =  E(x.RS)  =  a  DC  =0].  Given  that  d 

is  unrelated  to  x^  and  x^,  it  follows  that  the  conditional  expected 
values  of  ^  are  determined  by  a  conditional  function,  or 
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i  '  2 

E(y,  x. ,x,)  =  B  x,  +  B  x0.  In  other  words,  the  equation 
i.1  A  £  1  £  ~^2^2  — 

is  self-contained  and  the  functional  equation,  and  functional 


relation  =  ^(£^'£2'— y  ^  '  ma^  use<^  to  rePresent  a  causal 

relation  (assuming  that  other  conditions  have  been  met) . 


Now,  suppose  a  relevant  cause  of  role  overload  is  not  included 
explicitly  in  the  functional  equation.  A  possible  example  of  such  a 
relevant  cause  is  the  degree  to  which  the  work  environment  is  perceived  to 
be  "impersonal".  An  impersonal  environment  suggests  that  management 
focuses  on  productivity  and  profit,  with  little  attention  given  to 
employees'  needs  and  capabilities.  By  contrast,  a  "personal"  environment 
connotes  consideration  of  employees'  needs  and  capabilities  in  addition  to 
concerns  for  productivity  and  profit.  Employees  are  predicted  to  be 
overloaded  in  impersonal  environments,  in  comparison  to  personal 
environments,  because  managanent  in  such  environments  strives  for  economic 
and  performance  goals  even  after  the  personal  capacities  of  employees  have 
been  exceeded.  Thus,  "impersonality"  might  be  hypothesized  to  be  a  major 
cause  of  role  overload. 


Another  important  characteristic  of  a  relevant  cause  is  that  it 
covaries  with  a  causal  variables) included  explicitly  in  a  functional 
equation.  Impersonality  is  likely  to  covary  with  both  expected  quality  and 
quantity  of  role  performance  because  managers  who  focus  only  on 
productivity  and  profit  are  more  likely  to  demand  higher  levels  of  quality 
and  quantity  of  performance  than  managers  more  attuned  to  the  effects  of 
their  demands  on  employees.  Cn  the  other  hand,  impersonality  is  not 
expected  to  be  redundant  with,  or  linearly  dependent  on,  quality  and 
quantity  of  role  performance.  In  other  words,  impersonality  might  be 
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viewed  as  having  a  unique  effect  on  role  overload. 

A  model  including  impersonality  is  displayed  in  Figure  2.4b.  The 
unmeasured^  (impersonality)  is  represented  in  the  disturbance  term  for 
along  with  the  RS  component.  It  is  assumed  that  a  fits  all  the 
criteria  for  a  relevant  cause  of  including  relationships  with 
and  x-  (indicated  by  the  curved,  double-headed  arrows  between  a  and 
both  _x1  and  x^) ,  but  not  linear  dependence  on  x^  and  x2.  The 
curved,  double-headed  arrows  also  indicate  that  the  relationships  between  a 
and  the  >£' s  are  not  to  be  explained  by  this  model. 

The  functioned  equation  for  Figure  2.4b  is 


%  ’  *  \i2-l  *  (%  "  5§  +  S>  <2' 10> 

The  curved  (double-headed)  arrows  relating  a  to  x ^  and  x2  in  Figure 
2.4b  show  that  both  x.  and  x -  should  covary  with  d  .  Hie  expected 

—  —  h 

value  of  the  covariance  between  x  and  d  is: 

-  ^1 

E(x,d  )  =  E  [x,  (RS  +  a)  ] , 

£ 

=  E  (x^RS)  +•  E  ( x-^a)  , 


(2.11) 


where  cr  is  a  (population)  covariance.  A  similar  derivation 

demonstrates  covariation  between  and  d  . 

-  *1 

Assuming  linearity  and  additivity,  covariation  between  d  and 

^1 

x.  and  x_  brought  about  by  inclusion  of  a  in  d  implies  that 
-  —  *1 

the  functional  equation  (Eq.  2.10),  and  the  model  from  which  it  wbs  derived. 
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are  not  self-contained,  lb  be  specific,  the  model  in  Figure  2.4b  is  not 
self-contained  because  the  conditional  expected  values  of  ^  are  no 
longer  determined  by  the  conditional  function  B  x  +  8  x?. 

iA  -  T-l-2  - 

To  achieve  self-containment,  not  only  should  this  function  include  a  term  for 

a  (i.e.,  ^a)  ,  but  also  the  structural  parameters  for  x^  and 

x_  should  include  controls  for  a  (e.g.,  B  ) .  Thus, 

with  £  unmeasured,  Eq.  2.10  is  misspecified  because  it  omits  a  relevant 

cause.  In  fact,  Eq.  2.10  is  not  a  functional  equation  because  it  is  not 

self-contained.  Consequently,  neither  the  equation  nor  the  functional 

relation  represents  a  causal  relation. 

In  summary,  the  condition  of  self-containment  is  satisfied  only  if  the 
causes  included  explicitly  in  a  functional  equation  are  unrelated  to  the 
disturbance  term  of  that  equation.  Pragmatically,  failure  to  satisfy  this 
condition  results  in  biased  estimates  of  structural  parameters,  which  is 
demonstrated  in  Condition  9.  Of  importance  here  is  the  fact  that  many 
published  confirmatory  studies  in  psychology  have  paid  no  attention 
whatever  to  *-he  need  for  self-contained  equations  (and  systems)  (Billings  & 
Wroten,  1978;  James,  1980;  Cliff,  1980).  Review  of  these  studies  reveals 
obvious  and  serious  violations  of  the  self-contairsnent  condition. 


Consequently,  it  is  our  opinion  that  the  confirmatory  literature  in 
psychology  is  inundated  with  biased  estimates  of  structural  parameters,  or, 
if  you  wish,  equations  that  have  only  remote  bearing  on  causal  relations. 
This  is  rather  difficult  to  understand  inasmuch  as  almost  all  introductory 


texts  on  confirmatory  analysis  highlight  the  need  to  include  relevant  causes 
in  functional  equations,  although  the  terminology  and  format  used  to  present 
this  condition  vary.  For  example,  the  condition  is  discussed  in  terms  of 
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covariation  between  causes  and  disturbance  terms  (Duncan,  1975;  Johnston, 
1972),  covariation  among  disturbance  terms  (Namboodiri  et  al.,  1975),  and 
nonspur iousness  (Kenny,  1979). 

This  problem  deserves  immediate  attention.  Unfortunately,  such 
attention  is  unlikely  to  result  in  total  resolution  of  the  problem  because 
all  relevant  causes  of  endogenous  variables  are  not  likely  to  be  known 
(Duncan,  1975;  Heise,  1975;  Kenny,  1975).  As  noted  by  James  (1980,  p. 

415) ,  "The  operative  question  is  not  whether  one  has  an  unmeasured 
variables  problem  but  rather  the  degree  to  which  the  unavoidable 
unmeasured  variables  problem  biases  estimates  of  path  coefficients  [a  form 
of  structural  parameter]  and  provides  a  basis  for  alternative  explanations 
of  results."  This  statement  was  grounded  on  the  logic  that  (a)  scientific 
investigation  must  proceed  based  on  what  is  known  at  the  present  time, 
recognizing  that  present  knowledge  is  incomplete;  (b)  one  must  judge 
whether  enough  information  has  accumulated  to  justify  both  the  development 
of  a  structural  model  and  the  conduct  of  a  confirmatory  analysis;  and  (c) 
all  known  relevant  causes  should  be  included  in  a  model  and  its 
functional  equations.  In  other  words,  given  that  one  believes  that  it  is 
reasonable  to  proceed  with  confirmatory  analysis,  then  thr  pragmatic 
question  is  whether  known  relevant  causes  are  included  in  the  functional 
equations.  In  this  regard,  it  is  fair  to  ask  whether  prior  failures  to 
include  relevant  causes  in  structural  models  and  functional  equations 
could  have  been  avoided  by  not  only  an  awareness  of  the  self-containment 
condition,  but  also  by  more  thoughtfully  developed  structural  models.  We 
recommend  that  investigators  submit  their  models  to  their  own  and  others' 


careful  scrutiny,  attempting  to  identify  unmeasured  relevant  causes, 
before  data  are  collected.  In  the  discussion  of  Condition  9  we  present  a 
set  of  decision  steps  that  assist  in  evaluating  the  likelihood  that 
unmeasured  relevant  cases  will  bias  estimates  of  structural  parameters. 

At  that  time  we  will  also  discuss  the  use  of  a  time  series  design  that 
makes  possible  the  minimization  of  bias  in  estimates  of  structural 
parameters  created  by  unmeasured  relevant  causes. 

Condition  6 :  Specification  of  Boundaries 

Boundaries  specify  the  contexts  (e.g.,  types  of  subjects  and 
environments)  within  which  functional  relations  are  expected  to 
generalize.  Given  linearity  in  both  parameters  and  variables,  this 
condition  is  satisfied  if  the  functional  equation  relating  an  effect  to 
its  causes  is  not  contingent  on  other  variables  (Kenny,  1979)  .  In  other 
words,  the  functional  relation  should  be  additive.  The  condition  is 
violated  if  the  functional  relation  is  contingent  on,  or  moderated  by, 
the  values  on  a  third  or  more  variables.  This  is  because  a  significant 
moderator  effect  implies  interaction,  from  vtfiich  it  follows  that  direct 
causal  connections  of  the  form  x  cannot  be  interpreted 
unanbiguously . 

Tests  for  interaction/moderation  should  be  conducted  as  part  of  a 
confirmatory  analysis  vrtien  prior  research  and/or  theory  suggest  the 
presence  of  a  moderator.  If  the  potential  moderator  is  a  categorical 
variable,  then  the  values  of  the  structural  parameters  may  be  estimated 
separately  for  each  value  of  the  moderator,  such  as  one  analysis  for  males 
and  another  analysis  for  females.  Tests  for  moderation  are  then  often 


♦ 


based  on  an  assessnent  of  whether  the  estimates  of  the  structural 


parameters  vary  as  a  function  of  the  "subgroups"  established  by  different 
values  of  the  moderator  (cf.  Schoenberg,  1972;  Joreskog  &  Sorbom,  1979; 
Specht  &  Warren,  1976) .  The  term  subgroup  refers  to  subpopulations 
(subsamples)  in  which  all  relationships  are  linear  and  additive  (cf. 
Zedeck,  1971) .  If  the  moderator  (or  moderators)  is  a  continuous  variable 
(e.g.,  age),  then  more  complex  analytic  procedures  are  required.  In 
general,  these  procedures  assune  a  form  similar  to  moderated  regression 
(cf.  Cohen  &  Cohen,  1975) ,  although  more  sophisticated  methods  are 
required  for  causal  interpretation.  An  excellent  and  readable  discussion 
of  these  methods  was  presented  recently  by  Stolzenberg  (1979)  .  Methods 
for  estimating  structural  parameters  given  nonlinearity  in  the  variables 
are  also  presented  by  Stolzenberg. 

In  summary,  our  primary  concern  here  is  to  point  out  that  the 
existence  of  a  moderator,  or  moderators,  requires  careful  specification  of 
the  boundaries  of  a  structural  model.  Taking  the  simple  case  where  a 
moderator  is  a  categorical  variable,  a  different  structural  model  and 
accompanying  functional  equations  are  required  for  each  value  of  the 
moderator  (i.e.,  each  subgroup).  In  other  words,  each  value  of  the 
moderator  forms  the  boundary  for  each  structural  model.  Clearly, 
moderators  inflate  the  complexity  of  confirmatory  analysis,  and  attempts 
should  be  made  to  specify  the  original  boundaries  of  a  structural  model  in 
a  manner  that  reduces  the  likelihood  of  moderation.  Conversely,  failure 
to  identify  a  significant  moderator  will  result  in  inaccurate  estimates  of 
structural  parameters  and  erroneous  causal  inferences  regarding  the 
magnitudes  of  causal  effects. 

Condition  7:  Stability  of  the  Structural  Model 
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A  structural  model  is  expected  to  be  stable.  Stability  is  indicated 
if  the  values  of  structural  parameters  in  functional  equations  are 
invariant,  or  stationary,  over  specified  time  intervals  (cf.  Pindyck  & 
Rubinfeld,  1976).  The  importance  of  stability  becomes  clear  if  its 
obverse  is  considered.  If  the  variables  in  a  structural  model  are  in  a 
state  of  flux,  then  the  relationships  among  the  variables  are  likely  to 
change  rather  rapidly.  Consequently,  data  collected  at  two  points  in 
time,  within  a  relatively  short  time  interval,  would  provide  two  different 
sets  of  equations  for  the  same  set  of  variables;  that  is,  the  estimated 
structural  parameters  would  assume  different  values  at  each  point  in  time. 
This  is  an  impossible  situation  because  the  structural  model  could  not  be 
generalized  beyond  a  few  days  or  weeks.  On  the  other  hand,  if  the 
parameter  estimates  remain  the  same  over  time  periods  that  are  reasonable 
and  meaningful  for  the  variables  of  interest,  then  a  basis  is  provided  for 
generalizing  the  results  of  a  confirmatory  analysis  beyond  very  restricted 
time  intervals.  Note,  however,  that  the  stability  condition  does  not 
suggest  that  a  particular  structural  model  will  be  invariant  with  respect 
to  long  periods  of  time.  The  point  is  that  a  structural  model  should 
possess  enough  stability  for  generalization  across  a  reasonable  and 
meaningful  time  interval  (cf.  Simon,  1977). 

Equilibrium- type  Condition 

A  subtle  but  critical  implication  of  the  stability  condition  is  that 
the  values  on  the  variables  in  a  functional  equation  should  have  reached  a 
temporary  state  of  approximate  constancy  before  an  attempt  is  made  to 
measure  them  and  to  use  the  data  to  estimate  structural  parameters.  This 
temporary  state  of  constancy  is  referred  to  as  the  equil ibr inn- type 


condition  (Namboodiri  et  al.,  1975).  The  equilibrium type  condition  is 
almost  unheard  of  in  the  psychological  literature,  a  literature  which 
relies  heavily  on  cross-sectional  designs  in  confirmatory  analysis. 

Stated  directly,  it  is  precisely  the  assumption  that  the 
equilibrium-type  condition  has  been  satisfied  that  justifies  using 
cross-sectional  designs  to  estimate  structural  parameters.  Specifically, 
cross-sectional  designs  attempt  to  model  (capture)  causal  processes  that 
have  already  occurred  via  functional  equations  that  employ  fixed  constants 
(i.e.,  structured  parameters)  (Pindyck  &  Rubinfeld,  1976).  Estimation  of 
the  fixed  constants  is  justified  only  in  the  condition  that  the  effects  of 
the  exogenous  variables  on  the  endogenous  variables  have  worked  their  way 
through  the  causal  system.  This  means  that  subjects'  scores  on  the 
endogenous  and  exogenous  variables  are  temporarily  fixed,  or 
"equilibrated"  (Heise,  1975) ,  or  at  least  any  effect  which  does  occur 
during  the  equilibrium  period  is  so  rapid  that  a  temporary  equilibrium  is 
rapidly  reestablished  (Simon,  1977) .  Given  an  equilibrium-type  condition, 
in  combination  with  (a)  sampling  that  has  been  of  sufficient  breadth  to 
guarantee  large  differences  among  subjects  on  each  of  the  variables,  and 
(b)  functional  equations  that  are  appropriate  for  all  subjects  in  the 
population  from  vtoich  a  random  sample  was  drawn,  one  may  model  prior 
causal  processes  by  relying  on  cross-sectional  comparisons  across  subjects 
to  infer  processes  that  have  been  at  work  for  a  particular  subject  over 
time  (Heise,  1975;  Miller,  1971;  Namboodiri  et  al.,  1975). 

Consider,  for  example,  the  structural  model  for  role  overload  and 
state  anxiety  (Figure  2.1 — which,  for  illustrative  purposes,  is  considered 
self-contained)  .  The  preceding  discussion  suggests  that,  for  a 
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cros&-sectional  design,  one  would  want  to  sample  work  environments  and 
individuals  so  as  to  guarantee  large  variations  in  expected  quality  and 
quantity  of  role  performance,  perceptions  of  role  overload,  and  state 
anxiety.  Data  should  be  collected  only  after  the  effects  of  role 
expectations  on  role  overload,  and  the  effects  of  role  overload  on  state 
anxiety,  have  had  a  chance  to  stabilize  (equilibrate)  in  each  work 
environment  and  for  each  subject;  that  is,  only  after  values  on  the 
variables  have  reached  a  temporary  state  of  constancy.  Assuming  the 
functional  relations  are  applicable  for  all  subjects  and  work  environments 
(see  Condition  6) ,  analytic  procedures  discussed  in  Conditions  9  and  10 
could  then  be  used  to  capture  the  prior  causal  processes  that  generated 
the  particular  configuration  or  pattern  of  values  on  the  variables 
obtained  in  the  cross-sectional  analysis. 

An  equilibrium-type  condition  is  also  assumed  for  each  wave  (time)  of 
measurement  in  a  time  series  analysis.  However,  the  temporary  equilibrium 
at  a  particular  time  is  lost  as  soon  as  changes  occur  in  one  or  more 
causal  variables.  The  changes  in  causal  variables  produce  changes  in 
effects,  and  a  new  temporary  equilibrium  is  established  \«hen  the  values  on 
all  variables  reach  a  new  temporary  state  of  constancy.  This  is  referred 
to  as  a  "shift  in  equilibrium  levels"  (cf.  Namboodiri  et  al.,  1975). 
Given  changes  in  a  causal  variables,  the  time  period  required  to 
reestablish  a  temporary  equilibrium  in  an  effect  is  referred  to  as  the 
"equilibration  time"  (Heise,  1975),  which  is  the  causal  interval.  (It  is 
assumed  that  the  values  of  causes  remain  constant  within  the  causal 
interval.)  If  (a)  the  functional  relations  are  stationary  (i.e.,  the 
structural  parameters  are  invariant  with  respect  to  shifts  in  equilibrium 
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levels)  and  (b)  times  of  measurement  correspond  closely  to  causal 
intervals,  then  (c)  it  is  possible  to  attempt  to  fit  functional  equations 
to  the  shifts  in  equilibria  levels.  Only  then  can  we  use  these 
functional  equations  to  infer  the  causal  processes  that  produced  the 
shifts  (cf.  Namboodiri  et  al.,  1975). 

Tb  summarize,  the  stability  condition  is  satisfied  if  the  values  on 
variables  have  reached  a  temporary  state  of  constancy  for  each  time  of 
measurement  (equilibrium- type  condition)  and  the  functional  equations  are 
the  same  over  specified  time  intervals  (stationarity)  .  These  points  apply 
to  recursive  models,  including  cyclical  recursive  models.  They  apply  also 
to  nonrecursive  models,  although  the  process  by  which  the  endogenous 
variables  involved  in  dynamic,  reciprocal  relationships  reach  an 
approximate  state  of  equilibrium  requires  advanced  mathematics.  A 
relatively  nontechnical  overview  of  this  process  is  presented  in  Heise 
(1975)  and  Namboodiri  et  al.  (1975) 

Eguilibriim-tyipe  conditions  and  causal  order. 

It  was  suggested  in  Condition  3  that  the  causal  ordering  among 
variables  is  often  less  than  obvious  and  subject  to  misspecification. 
Consider,  as  an  example,  the  popular  "social  systems"  concept  (cf.  Katz  & 
Kahn,  1978),  vhich  in  many  areas  of  psychology  gives  rise  to  serious 
concerns  about  causal  ordering  because  all  variables  in  the  social 
system  are  regarded  as  causally  related  to  each  other,  directly  or 
indirectly,  we  illustrate  this  by  the  simple,  nonelaborate  model  showi  in 
Figure  2.5a.  This  model  predicts  that  increases  in  organizational  control 
processes  (e.g.,  implementation  of  weekly  time  and  effort  statements) 
cause  increases  in  anployee  dissatisfaction.  These  in  turn  lead  to 
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increases  in  dysfunctional  behaviors  (e.g.,  absenteeism,  clock-watching, 
work  slowdowns)  .  Etysfunctional  behaviors  may  then  serve  as  a  stimulus  for 
even  greater  organizational  controls,  which  lead  to  greater 
dissatisfaction,  and  so  on,  so  that  an  iterative,  recurring  cycle  of 
control  -*■  dissatisfaction  -*■  dysfunctional  behaviors  -*■  control  is 
established.  Note  that  the  point  at  which  the  cycle  begins  is  arbitrary; 
the  cycle  might  also  have  started  with  employee  dissatisfaction  or 
dysfunctional  behaviors. 

This  model  does  not  present  a  major  problem  for  a  time  series  design. 
In  this  type  of  design,  the  feedback  processes  could  be  captured  by  a 
cyclical  recursive  model,  beginning  at  an  arbitrary  point  and,  for 
example,  obtaining  two  measures  of  each  variable,  where  time  of 
measurement  corresponds  to  causal  intervals  (equilibration  times) .  Ch  the 
other  hand,  this  model  creates  a  major  problem  for  cross-sectional  designs 
because  once  the  cycle  is  operative,  the  causal  relationships  are  in  a 
system  of  infinite  regress  and  specification  of  causal  order  is  arbitrary. 


Insert  Figure  2.5  about  here 


Translated  into  the  terminology  of  structural  models,  this  reasoning 
implies  that  all  variables  are  endogenous  in  a  cross-sectional 
application  of  Figure  2.5a.  Confirmatory  analysis  cannot  proceed  in  such 
an  ambiguous  situation.  It  is  necessary  to  identify  predetermined, 
exogenous  variables  that  are  not  caused  by  endogenous  variables,  and  to 
use  these  predetermined  variables  to  establish  causal  precedence.  In 
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pursuit  of  this  objective,  Miller  (1971)  reasoned  that  even  though  all 
variables  in  a  structural  model  may  be  related,  particularly  over  long 
periods  of  time,  it  is  possible  that  the  effects  of  some  variables  on 
others  will  be  so  small  or  infrequent  as  to  be  negligible  in  a  particular 
interval  in  time.  Consequently,  within  a  specific  time  interval  that  is 
relatively  short  but  general izable,  it  is  possible  to  establish  a  causal 
order  for  cross-sectional  designs  by  specifying  that  (a)  causes  must  have 
at  least  a  moderate  influence  on  effects  within  the  time  interval,  and  (b) 
a  variable  whose  causal  influence  is  slow  and  thus  does  not  occur  within 
the  bounds  of  the  time  interval  should  be  treated  only  as  an  effect. 

For  example,  in  the  illustration  in  Figure  2.5  one  might  postulate 
that  the  effects  of  organizational  control  on  dissatisfaction,  and 
dissatisfaction  on  dysfunctional  behaviors,  occur  much  more  rapidly  than 
the  effects  of  dysfunctional  behaviors  on  organizational  control.  This  is 
reasonable  given  that  affective  and  behavioral  outcomes  are  individually 
determined  and  may  occur  quickly,  whereas  changes  in  organizational 
control  may  require  multiple  inputs  from  different  line  managers  and  staff 
(e.g.,  legal  opinions),  deliberations  regarding  alternatives,  and  time  to 
implement  formal  decisions.  Consequently,  a  time  interval  could  be 
identified  in  vrfiich  control  influences  dissatisfaction  and  then 
dissatisfaction  influences  behaviors,  but  behaviors  do  not  significantly 
influence  control.  Thus,  dysfunctional  behaviors  could  be  treated  only  as 
an  effect.  That  is,  we  expect  the  influence  of  dysfunctional  behaviors  on 
organizational  control  to  take  a  period  of  time  greater  than  that  bounded 
by  the  time  interval.  It  must  be  emphasized,  however,  that  the  model 
could  not  be  generalized  beyond  the  specified  temporal  bound. 
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In  effect,  the  preceding  argument  rests  on  establishing  a  causal  order 
using  differences  in  equilibration  times  and  assumptions  regarding  strengths 
of  causal  influences  in  a  short  but  general izable  equilibrium- type  condition. 
Tt>  illustrate,  consider  Figure  2.5b,  which  displays  the  presumed  causal 
ordering  in  the  time  interval  extending  from  to  t^,  and  an 
equilibrium- type  condition  extending  from  time  t^  to  time  _t^> 

Organization  control  is  assumed  to  have  stabilized  (equilibrated)  at  time 
t^,  and  to  remain  constant  until  time  t^.  The  interval  between 
t^  and  t.^  is  the  equilibration  time  required  for  employee 
dissatisfaction  to  stabilize;  that  is,  the  causal  interval  for  the 
organizational  control  employee  dissatisfaction  causal  effect.  Once 
employee  dissatisfaction  has  stabilized,  it,  like  control,  is  expected  to 
remain  constant  through  at  least  time  t^.  The  time  interval  between 
t^  and  t2  is  the  equilibration  time  for  the  employee  dissatisfaction  -*■ 
dysfunctional  behaviors  causal  influence.  Finally,  the  period  encompassed  by 
t2  to  _t3  is  the  equilibrium-type  condition,  where  the  values  on  all 
three  variables,  including  dysfunctional  behaviors,  are  expected  to  remain 
relatively  constant.  This  is  the  time  period  within  which  data  should  be 
collected  for  a  cross-sectional  analysis. 

The  equilibrium- type  condition  will  end  at  the  time  that  values  on 
the  variable  representing  organizational  control  begin  to  change  as  a 
result  of  the  causal  influences  of  dysfunctional  behaviors.  However,  the 
equilibration  time  for  the  dysfunctional  behaviors  -*■  organizational 
control  causal  relation  is  expected  to  be  substantially  longer  than  the 
time  period  extending  from  tg  to  t3.  This  implies  that  (a)  the 
equilibration  times  represented  by  t^  to  and  t^  to  t2  are 
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much  more  rapid  (i.e.,  shorter)  than  the  equilibration  time  for  the 
dysfunctional  behaviors+organizational  control  causal  influence,  and  (b) 
given  the  slow  dysfunctional  behavio  ^organizational  control  causal  effect, 
values  on  the  variables  may  be  regarded  as  equilibrated  during  the 
equilibrium- type  condition  (_t2  to  _t^)  • 

Tb  summarize,  in  this  subsection  we  have  shown  how  the  equilibrium- 
type  condition  is  useful  when  theorizing  about  causal  ordering  in 
cross-sectional  designs.  Salient  implications  of  using  differences  in 
equilibration  times  and  assumed  lengths  of  equilibrium- type  conditions  to 
predict  causal  orderings  are  (a)  "time"  is  an  important  consideration  in 
cross-sectional  designs;  (b)  researchers  must  be  specific  about 
assumptions  regarding  time,  especially  the  general izability  of  a  presumed 
causal  ordering  in  regard  to  time;  and  (c)  different  assumptions  regarding 
time  may  lead  bo  different  causal  orderings.  On  the  other  hand, 
assumptions  regarding  time  are  less  demanding  in  cross-sectional  designs 
than  in  time  series  designs.  Fbr  example,  one  may  "get  by"  with  only 
assumptions  regarding  "relative  differences"  in  equilibration  times  in 
cross-sectional  designs,  whereas  time  series  analysis  requires  specific 
knowledge  (or  predictions)  of  actual  causal  intervals. 

Summary 

This  completes  the  discussion  of  the  seven  conditions  pertaining  to 
the  appropriateness  of  theoretical/structuoral  models  for  confirmatory 
analysis.  The  seven  conditions  are  summarized  in  Figure  2.6.  It  must  be 
stressed  that  lack  of  reasonable  satisfaction  of  one  or  more  of  these 
conditions  results  in  questionable  use  of  confirmatory  analysis.  We  shall 
continue  to  emphasize  this  point  in  the  remaining  presentations  in  this 


text,  where  false  conclusions  regarding  empirical  confirmation  or 
disconfimation  are  shown  to  result  from  violation  of  aspects  of 
Conditions  1  thrugh  7,  as  well  as  Condition  8  (operationalization  of 
constructs) . 


Insert  Figure  2.6  about  here 


We  now  turn  to  a  review  of  the  conditions  pertaining  to  the 
operational  aspects  of  confirmatory  analysis.  Because  these  operational 
aspects  are  the  subject  of  a  rapidly  growing  literature  in  psychology,  not 
to  mention  large  literatures  in  areas  such  as  econometrics  and  sociology, 
the  presentation  emphasizes  the  logic  of  confirmatory  analytic  procedures. 
References  are  suggested  for  more  extensive  treatments  of  methods. 

Conditions  Pertaining  to  Operational  Aspects  of  Confirmatory  Analysis 

Condition  8 :  The  Operationalization  of  Variables 

We  have  used  the  term  "variable"  to  refer  to  sets  of  classes  of  events 
formed  because  the  classes  of  events  have  some  attribute  in  common.  For 
exanple,  we  may  form  a  class  of  events  from  all  those  things  weighing 
pounds.  If  we  have  several  such  classes,  each  associated  with  a 
different  weight,  then  we  may  join  these  classes  together  to  provide  a  set 
of  classes  of  events,  all  of  which  have  the  common  attribute  weight.  This 
set  will  provide  the  basis  for  the  variable  "weight".  Now,  operational¬ 
ization  of  a  variable  means  specifying  the  operations  by  which  we  would 
seek  to  assign  a  specific  event  (thing)  occurring  in  the  world  to  one  of 


64 

the  classes  of  events  of  a  variable,  where  the  events  in  each  class 
share  a  common  value  on  a  scale  in  which  all  classes  of  events  have  a 
common  attribute. 

An  important  aspect  of  operationalization  is  measurement,  which 
concerns  the  assignment  of  numbers  to  classes  of  events  so  that 
relationships  among  the  numbers  correspond  to  empirical  relationships  among 
the  classes  of  events.  Measurement  is  not,  however,  merely  the  assigrment 
of  numbers  to  classes  of  events  of  a  variable.  The  relations  among  the 
assigned  numbers  must  correspond  to  an  already  established  empirical 
relationship  among  the  classes  of  events.  A  common  but  grave  mistake  is  to 
assume  that  by  assigning  numbers  in  an  arbitrary,  or  at  best  not  well 
understood  way,  one  has  achieved  measurement.  The  mistake  is  especially 
compounded  if  one  believes  there  will  be  magical  mathematical  techniques, 
such  as  exploratory  factor  analysis  or  multidimensional  scaling,  that  will 
salvage  clear  empirical  meaning  from  the  numbers. 

When  numbers  have  been  assigned  to  the  classes  of  events  of  a 
variable,  we  then  have  a  quantitative  variable.  Moreover,  when  the 
numbers  assigned  to  classes  of  events  represent  meaningful  empirical 
relationships  among  the  classes,  then  we  may  give  empirical  meaning  to 
quantitative  relationships  between  one  measured  variable  and  another.  In 
particular,  we  will  be  interested  in  establishing  functional  relationships 
between  variables  by  employing  quantitative  variables  to  represent 
variables  in  functional  equations,  lb  do  this,  we  must  presume  in  our 
applications  that  the  scientist  has  quantitative  variables  with  at  least  an 
interval  level  of  measurement.  Thus,  for  the  purposes  of  discussion,  we 
will  assume  that  all  forms  of  variables  are,  or  may  be,  represented  as 
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quantitative  variables  with  interval  scales.^  Consequently,  the  term 
quantitative  is  not  specified  explicitly. 

The  preceding  discussion  of  operationalization  of  variables  and 
measurement  focused  on  manifest  variables,  which  were  defined  earlier  as 
variables  associated  with  directly  observable  events.  The  empirical 
content  of  a  manifest  variable  is  reflected  directly  by  assigning 
observable  events  to  values  on  the  measurement  scale  of  the  variable.  We 
also  introduced  earlier  the  term  latent  variable,  which  was  defined  as  an 
abstract  or  theoretical  construct  associated  with  presumed,  but  not 
directly  observable,  events.  Latent  variables  cannot  be  measured  directly, 
rather  they  derive  their  empirical  content  through  linkages  with  the 
directly  measurable  manifest  variables.  A  latent  variable  is  usually 
portrayed  as  a  ccmmon  factor  which  acts  as  a  cause  of  one  or  more  manifest 
variables  and  whose  empirical  content  can  only  be  assessed  indirectly  by 
examining  empirical  relations  among  manifest  variables.  A  latent  variable 
is  operationalized  in  the  sense  that  attempts  are  made  to  articulate  not 
only  linkages  between  hypothetical  constructs  and  manifest  variables,  but 
also  linkages  among  hypothetical  constructs.  Both  of  these  attempts  to 
articulate  a  latent  variable  are  dependent  on  empirical  relations  among 
manifest  variables. 

When  each  theoretical  construct  (latent  variable)  is  represented  by  a 
single  corresponding  manifest  variable  in  a  structural  model,  the  model 
is  referred  to  as  an  observed  or  manifest  variable  design.  In  this 
case,  each  manifest  variable  is  used  as  a  direct  surrogate  of  a  thoeretical 
construct,  and  relationships  among  the  manifest  variables  are  used  as 
direct  surrogates  of  relationships  among  the  constructs.  'When  manifest 


variables  are  used  in  this  manner,  it  is  critical  that  the  variables  be 
accurate  representations  of  the  constructs.  Accurate  representations  are 
indicated  vrfaen  (a)  the  manifest  variables  are,  technically,  perfectly 
reliable,  although  high  reliability  is  generally  considered  sufficient  for 
reasonable  satisfaction  of  this  condition  (cf.  Duncan,  1975);  and  (b)  the 
manifest  variables,  and  relationships  among  the  manifest  variables,  are  not 
subject  to,  or  at  least  are  minimally  influenced  by,  nonrandcm  measurement 
errors  introduced  by  extraneous,  unmeasured  influences  or  improper 
measurement  procedures.  Included  in  the  broad  category  of  nonrandcm 
measurement  errors  are:  (a)  aggregation  and  disaggregation  bias  (cf. 
Borgatta  &  Jackson,  1980;  Firebaugh,  1978;  Hannan,  1971a,  1971b);  (b) 
ceiling  and  floor  effects  in  measurement  scales  (cf.  Carroll,  1961);  (c) 
classification  errors,  such  as  reducing  a  psychometr ically  reliable  and 
valid  continuous  scale  to  a  dichotomy  (cf.  Namboodiri  et  al.,  1975);  (d) 
method  variance,  which  suggests  that  covariation  among  manifest  variables 
representing  different  constructs  is  spuriously  inflated  due  to  a  common 
measurement  procedure  (cf.  Campbell  &  Fisk,  1959;  Cronbach  &  Meehl,  1955); 
and  (e)  serially  correlated  errors  of  measurement  resulting  from  the  use  of 
the  same  measurement  scale(s)  in  two  or  more  waves  of  data  collection  (cf. 
Werts,  Linn  &  JOreskog,  1971). 

Psychologists  are  generally  aware  of  the  basic  tenets  of  scale 
development,  reliability,  and  most  of  the  concerns  associated  with 
nonrandcm  measurement  errors.  They  are  also  aware  of  the  fact  that  it  is 
extremely  difficult,  and  at  times  impassible,  to  develop  measurement 
instruments  that  encompass  only  snail  amounts  of  measurement  error  and  are 
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free  of  nonrandom  measurement  errors.  Nevertheless,  while  some  slippage 
is  allowed  in  meeting  assumptions  regarding  random  and  nonrandom 
measurement  errors  in  confirmatory  analysis  with  manifest  variables,  the 
degree  of  slippage  allowed  is  comparatively  smaller  than  psychologists 
have  enjoyed  in  exploratory  (e.g.,  correlational)  analysis. 

The  reason  for  insisting  on  more  rigorous  psychometric  and 
statistical  criteria  is  that  the  investigator  is  playing  for  higher  stakes 
in  confirmatory  analysis.  Because  the  aim  of  confirmatory  analysis  is 
causal  inference  and  explanation,  confirmatory  analysis  must  be  based  on  a 
firmer  psychometric  and  statistical  base  than  is  often  found  in 
exploratory  studies.  Thus,  for  example,  "reasonable  satisfaction"  of  the 
reliability  assumption  in  a  confirmatory  analysis  on  manifest  variables 
still  requires  high  reliabilities.  While  available  research  does  not 
allow  for  an  unequivocal  specification  of  "high",  it  is  also  the  case  that 
rationale  such  as  "a  reliability  of  .70  was  sufficient  for  exploratory 
purposes"  is  unacceptable  because  a  reliability  of  .70  may  result  in 
serious  attenuation  of  parameter  solutions  or  estimates.  Furthermore, 
attenuation  in  parameter  solutions  and  estimates  is  only  one  of  many 
possibilities  when  a  functional  equation  involves  multiple  causal 
variables,  each  of  which  involves  rand cm  measurement  error  (e.g.,  the  bias 
may  be  upward  and  signs  may  be  reversed — cf.  Blalock,  Wells,  &  Chrter, 
1970;  Kenny,  1979).  In  this  regard,  Kenny  (1379)  suggested  that  in  the 
multivariate  case  the  bias  due  to  measurement  error  may  be  negligible  if 
(a)  reliabilities  are  high,  (b)  the  (true)  causal  effects  are  snail, 
and/or  (c)  the  causal  variables  have  low  intercorrelations.  We  suggest 
that  researchers  focus  their  attention  on  option  (a) . 


The  implication  of  the  need  for  rigorous  standards  regarding 
operationalization  of  manifest  variables  is  that  the  use  of  confirmatory 
procedures  may  be  restricted  for  manifest  variable  designs  in  psychology. 

We  illustrate  this  point  in  the  introduction  of  Section  4  (latent 
variables)  by  demonstrating  the  bias  in  estimates  of  structural  parameters 
resulting  from  random  measurement  errors  in  observed  variables.  In 
Section  4  we  also  danonstrate  how  one  can  proceed  with  confirmatory 
analysis  given  some  randan  measurement  error  in  manifest  variables  and 
certain  types  of  nonrandan  measurement  errors  (e.g.,  presence  of  method 
variance,  serially  correlated  measurement  errors)  if  one  uses  a  latent 
variable  form  of  analysis.  However,  for  the  present  we  will  introduce 
and  overview  the  logic  of  confirmation  and  disconfirmation  using  manifest 
variable  designs. 

Condition  9:  Empirical  Confirmation  of  Predictions  _I:  Empirical  Support  for 
Functional  Equations 

If  Conditions  1  through  7  are  considered  reasonably  satisfied,  and  if 
manifest  variables  are  reasonably  accurate  representations  of  the 
constructs  they  are  designed  to  measure,  then  it  is  possible  to  proceed 
with  confirmatory  analysis  using  manifest  variables.  The  objective  of 
such  an  analysis  is  to  confirm  or  disconfirm  a  structural  model.  This  is 
the  process  that  is  generally  thought  of  as  confirmatory  analysis,  where 
confirmation  implies  that  a  structural  model,  and  the  functional 
relations  and  equations  representing  the  model,  are  useful  for  making 
causal  inferences  to  explain  how  variables  occur  and  why  they  covary 
(excluding  purely  exogenous  variables)  .  Disconfirmation  implies  that 


the  structural  model  (functional  relations  and  equations)  are  not  useful 
in  this  regard.  Given  linear  functional  equations,  the  logical  foundation 
of  confirmation  and  disconfirmation  is  often  viewed  in  terms  of  the 
following  three  principles. 

1.  The  functional  relations  and  equations  relating  effects  to 
causes  in  a  linear  structural  model  may  be  used  to  derive 
a  set  of  predictions  regarding  the  observed  correlations 
(or  variances-covariances)  among  the  manifest  variables. 

2.  A  structural  model  is  confirmed  if  the  predictions  regarding 
correlations  (variances-covariances)  among  manifest  variables 
are  consistent  with  the  observed  (i.e.,  empirically  derived) 
correlations  (variances-covariances)  among  manifest  variables. 

A  structural  model  is  discontinued  if  predictions  and 
observed  correlations  (variancesr-covariances)  are  inconsistent. 

3.  confirmation  of  predictions  implies  corroborative  support  for 
the  structural  model  represented  by  the  functional  relations 
and  equations.  Disconfirmation  of  predictions  implies  that  one 
or  more  components  of  the  structural  model  (functional  relations 
and  equations)  is  false,  in  which  case  it  is  concluded  that  the 
structural  model  as  proposed  originally  is  invalid. 

The  present  condition  and  Condition  10  focus  on  Principles  1  and  2.  It 
is  shown  that  predictions  regarding  correlations  (variances-  covariances) 
among  manifest  variables,  and  confirmation/disconf irmation  of  these 
predictions,  can  be  addressed  anpirically  by  testing  predictions  regarding 
the  magnitudes  of  estimates  of  structural  parameters.  In  the  present 
condition,  we  consider  tests  of  whether  or  not  structural  parameters  that  are 


predicted  by  the  structural  model  to  be  nonzero  are  different  from  zero.  In 
Condition  10  we  discuss  tests  of  whether  or  not  structural  parameters  that 
are  predicted  by  the  structural  model  to  be  equal  to  zero  are  zero,  or 
approximately  zero.  When  combined,  these  two  sets  of  tests  furnish  the 
information  needed  to  ascertain  whether  a  model  has  been  confirmed  or 
discontinued. 

Principle  3  is  considered  in  Section  3  of  this  paper,  where  we  address 
the  roles  of  empirical  confirmation  and  discontinuation  in  causal  inference. 

The  discussion  of  Condition  9  is  organized  as  follows:  (a)  tests  of 
predictions  regarding  structural  parameters  associated  with  causes,  which 
includes  an  overview  of  identification,  ordinary  least  squares  (OLS ) 
estimation,  and  tests  of  significance  for  a  recursive  model;  (b)  the  use 
of  standardized  versus  unstandardized  manifest  variables,  where  advantages 
and  disadvantages  of  the  path  analytic  approach  to  confirmatory  analysis 
are  reviewed;  and  (c)  a  specification  error  of  major  interest  to 
psychologists,  namely  unmeasured  re.'evant  cases  in  equations  that  are 
pres uned  to  be  self-contained  functional  equations. 

Tests  of  Predictions  Regarding  Structural  Parameters  Associated  with  Causes 

Stated  simply,  inclusion  of  a  variable  as  a  cause  in  a  finct ional 
equation  indicates  that  the  structural  parameter  associated  with  that 
variable  is  hypothesized  to  be  different  from  zero.  It  follows  that  if 
(a)  we  employ  a  statistical  estimating  equation  to  represent  the 
functional  equation,  then  (b)  the  estimates  of  the  structural  parameters 
provided  by  the  statistical  equation  are  predicted  to  be  different  from 
zero.  Given  reasonable  satisfaction  of  Conditions  1  through  8,  the 
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predictions  regarding  estimated  structural  parameters  can  be  tested  in 
random  samples  from  well-defined  (i.e.,  boundaries  specified)  populations 
by  estimating  statistically  the  values  of  structural  parameters  and 
conducting  tests  of  significance  on  the  estimates.  If  the  estimate  of  a 
structural  parameter  is  significantly  different  from  zero,  then  the 
prediction  is  regarded  as  confirmed,  vdiich  implies  that  the  functional 
equation  is  consistent  with  empirical  data  (i.e.,  the  statistical 
estimating  equation) .  If  one  or  more  estimates  is  not  significantly 
different  from  zero,  then  the  prediction(s)  is  discontinued,  implying  that 
the  functional  equation  is  not  consistent  with  the  data. 

The  preceding  points  are  illustrated  using  the  structural  model  shown 
in  Figure  2.7.  Of  initial  importance  is  that  all  (manifest)  variables  in 
Figure  2.7  are  represented  by  an  x.  This  notation  is  adopted  to 
simplify  subscripting  parameters  and  statistics  in  equations.  The  subscripts 


for  the  x/s  denote  causal  order,  the  (i>j_)  are  structural 
parameters,  and  the  d^  are  disturbance  terms.  The  structural  model  is 
"fully  recursive",  which  means  that  (a)  the  model  includes  only  one  exogenous 


variable  (x^),  (b)  the  direction  of  all  causal  connections  is  asymmetric, 
and  (c)  each  event  higher  in  the  causal  order  is  a  function  of  all  events 


lower  in  the  causal  order,  plus  a  disturbance  term  [e.g.,  x^  =  J(x^, 

—2'— 3 '^4^  *  For  iHustrati-ve  purposes,  we  presume  that  empirical  data 

are  to  be  collected  on  a  cross- sectional  basis  on  a  randcm  sample  of  subjects 


from  a  well-defined  population. 


Insert  Figure  2.7  about  here 
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Hie  linear  functional  equations,  in  deviation  form,  are  shovn  in 
Figure  2.7  (Eqs.  2.7a  through  2.7c)  .  In  confirmatory  analysis,  a 
functional  equation  is  often  referred  to  as  a  structural  equation.  This 
suggests  that  the  structure  of  causal  connections  among  variables  is 
represented  by  the  equations.  We  will,  however,  continue  to  use  the  term 
"functional  equation".  Hie  objective  now  is  to  estimate  the  values  of  the 
structural  parameters  in  Bqs.  2.7a  through  2.7c. 

Identification.  Hie  first  step  in  this  process  is  to  ascertain 

whether  each  of  the  functional  equations  is  identified.  As  discussed 

earlier,  this  concerns  the  question  of  whether  sufficient  information  is 

available  to  obtain  unique  mathematical  solutions  of  the  structural 

parameters.  For  recursive,  manifest  variable  models,  a  functional 

equation  will  be  identified  (i.e.,  sufficient  information  is  available)  if 

(a)  the  causes  included  explicitly  in  the  functional  equation  are 

uncorrelated  with  the  disturbance  term  of  that  equation;  and  (b)  all  such 

causes  have  a  unique,  direct  effect  on  the  endogenous  variable  and  are  not 

4 

linearly  dependent  on  other  causes  included  explicitly  in  the  equation. 

We  will  focus  here  on  the  assumption  that  causes  are  uncorrelated  with 
disturbances. 

For  the  purpose  of  describing  identification,  we  will  regard  the 
manifest  variables  in  Eqs.  2.7a  through  2.7c  as  hypothetical  random 
variables  defined  on  a  hypothetical  population.  Now,  in  the  functional 
equation  for  x2  (Eq.  2.7a) ,  it  is  (theoretically)  possible  to  solve  for 
(note,  not  estimate)  the  structural  parameter  directly  by  (a) 
multiplying  through  Bq.  2.7a  by  x^,  (b)  taking  expectations  on  the 
resulting  equation,  and,  if  justified,  (c)  solving  for  algebraically 
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(cf.  Duncan,  1975) .  The  first  two  steps  are  as  follows: 


Functional  equation  for  x.2: 


-2  -21-1  +  -2 


(2.12) 


Multiplying  through  the  equation  by  x^: 


*2*1  -  -21—1-1  +  4A 


Taking  expectations: 


E  (2^2—1  ^  ~~  — 21~  ^  —  (^2-1  ^ 


-21  _  —21—1  +  ^1 


(2.13) 


where  <r_,  and  cr.  .  are  covariances  and  cr,  is  a  variance. 

—21  _2—  — 1 

Equation  2.13  is- often  referred  to  as  a  "normal  equation". 

The  values  assumed  by  the  random  variables  x^  and  x.2'  and  therefore 

2  — 
a;2^  and  ,  may  be  regarded  as  "known"  because  the  random  variables 

can  be  operationalized  as  manifest  variables.  It  is  not  possible  to  regard 

dL,  as  known,  one  obvious  reason  being  that  the  random  shock  component 

of  disturbances,  which  may  be  the  only  component,  is  by  definition 

unmeasurable.  Thus,  d„,  and  therefore  o\  .,  are  considered  "unknowns". 

—2  — 2— 

B21  is  also  unknown;  we  must  solve  for  its  value  based  on  the  known 

2 

values  of  and  Oj—  in  Eq.  2.13.  This  is  not  possible  at  the 

present  time  because  Eq.  2.13  has  two  unknowns  (i.e.,  and  ^ 

—  or,  one  equation  and  two  unknowns)  .  This  is  an  illustration  of 
under  identification,  which  means  that  sufficient  information  is  not 
available  to  obtain  a  unique  mathematical  solution  for  B^.  However,  if 
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we  can  assime  that  x.  is  unrelated  to  d„,  then  <y ,  ,  =  0  and  the 

-I  -2  ^2-i 

equation  is  identified.  That  is,  we  now  have  one  unknown  and  one  equation, 
and  is  easily  solved  for:  B ^  =  o^/c 1  *  ^  six|wn  *n 

Condition  5,  cr^  ^  =  0  if  d2  is  comprised  by  random  shocks 

exclusively,  (or  "by  random  shocks  and  nonrelevant  variables)  . 

This  logic  may  be  generalized  directly  to  the  functional  equations 
for  x^  and  x^  (Eqs.  2.7b  and  2.7c) .  Equation  2.7b  is  identified  if 
cr,  ,  =  or,  0  =  0,  and  EJq .  2.7c  is  identified  if  cr,  ,  =  a,  _ 

=  ffj  3  =  0.  Furthermore,  algebraic  derivation  will  show  that  lack  of 

covariation  between  the  causes  in  a  functional  equation  and  the  disturbance  of 
that  equation  implies  that. the  disturbance  terms  are  themselves  unrelated 
(Duncan,  1975) . 

Lack  of  relationship  among  the  disturbances  of  different  equations 

implies  that  the  system  of  equations  is  identified.  This  is  an  important 

consideration  if  all  structural  parameters  from  all  equations  are  to  be 

estimated  simultaneously.  On  the  other  hand,  if  the  structural  parameters 

for  each  functional  equation  are  to  be  estimated  separately,  then  it  is 

possible  to  estimate  parameters  for  the  identified  equations  but  not  for 

the  underidentified  equations.  Finally,  note  that  no  assumption  is 

required  for  covariations  of  the  form  cr,  where  i  >  j.  For 

— SLjA  — 

example,  x^  may  be  related  to  d2  ^)  without  affecting 
identification.  ~ 

Solutions  and  Estimates  of  Structural  Parameters.  Requirements  for 
identification  vary  as  a  function  of  the  type  of  structural  model  (cf. 

Fisher,  1966) .  We  will  return  to  the  critical  assumption  that  causes 
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included  explicitly  in  a  functional  equation  are  unrelated  to  the 

disturbance  of  that  equation  (and  by  implication  all  disturbances  later  in 

the  causal  order) .  For  the  present,  we  regard  this  assumption  as 

satisfied.  Thus,  functional  equations  2.7a  through  2.7c  are  considered 

identified.  As  shown  above,  this  implies  that  the  solution  for  is 
2 

*  Solutions  for  the  structural  parameters  in  Bqs.  2.7b  and 
2.7c  may  be  determined  in  a  manner  similar  to  that  used  to  solve  for  B^. 
For  example,  it  is  possible  to  solve  for  and  B32  in  Eq.  2.7b 
by  (a)  multiplying  through  Eq.  2.7b  by  x^  and  then  x2,  thus 
providing  two  equations;  and  (b)  taking  expectations  on  each  of  these 
equations.  Hie  resulting  normal  equations  are: 


%1 

-31-1  + 

-32^21 

-32 

=  —31—21  + 

-32^2  2 

(Eq. 

2.14)  and 

(2.14) 

(2.15) 


Therefore,  we  have  two  unknowns  (J3^  and  B^)  and  two  equations,  and 
it  is  possible  to  solve  for  J3^  and  B32  using  determinants.  The 
resulting  solutions  are: 


-31  ~  (-31-2  “  -21-32)/(-l2-2  -21  } 


—32  =  (23A2  "  <^31)/(^1  V  "  ^212) 


(2.16) 

(2.17) 


Inspection  of  Bqs.  2.16  and  2.17  demonstrates  that  the  solutions  for 
and  B^2  are  the  same  as  those  that  would  have  been  obtained  if  we  had 
simply  used  Ordinary  Least  Squares  (OLS)  estimation  to  solve  for  and 
B^2.  That  is,  B^  and  have  the  form  of  unstandardized 
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regression  weights.  The  same  is  true  for  in  Bq.  2.7a; 

2 

—21^—1  *s  t*ie  £°r  unstandardized  regression  weight  in  a 

bivariate  relation.  Finally  if  ^  =  2^,  2  ‘  ^  3  =  then  B^, 

B^'  and  B^  in  Eq.  2.7c  have  the  form  of  un standardized  regression 
weights  and  may  be  solved  by  using  OLS. 

Tb  summarize,  given  a  recursive,  cross-sectional  design,  if  the 
causes  in  a  functional  equation  are  unrelated  to  the  disturbance  of  that 
equation,  then  OLS  may  be  used  to  solve  for  the  values  of  the  structural 
parameters  (note  that  OLS  is  applied  separately  to  each  equation) .  This 
conclusion  was  predicated  on  hypothetical  random  variables  in  a 
hypothetical  population,  but  extends  to  operationalized  designs,  although 
one  must  be  mindful  of  the  fact  that  disturbances  can  not  be  measured 
directly,  lb  illustrate,  the  causes  and  effects  in  functional  equations 
2.7a  through  2.7c  may  be  replaced  with  manifest  variables  defined  on  a 
populaticn(with  boundaries  specified)  if  the  manifest  variables  satisfy 
the  assunptions  discussed  in  Condition  8.  It  is  also  possible  to  employ 
manifest  variables  in  the  algebra  used  to  derive  the  variance  and 
covariances  shown  in  EJqs.  2.13  through  2.17,  including  covariances 
between  causes  and  disturbances  and  solutions  of  the  structural 
parameters.  However,  the  covariances  involving  disturbances  involve 
relations  between  manifest  variables  and  hypothetical  variables  that 
cannot  be  measured  directly.  It  follows,  then,  that  these  covariances  can 
not  be  solved  for,  nor  estimated,  in  operationalized  designs.  In  other 
words,  assunptions  regarding  covariances  between  causes  and  disturbances 
are  a  theoretical  concern.  As  shown  shortly,  an  erroneous  assumption 
that  a  covariance  between  a  manifest  variable  and  a  disturbance  is  zero 
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has  serious  consequences  for  estimates  of  structural  parameters. 

Let  us  continue  the  illustration  with  the  assumptions  that  the 
functional  equations  are  operationalized  in  a  well-defined  population  of 
subjects,  with  manifest  variables  that  satisfy  reasonably  Condition  8  and 
are  unrelated  to  disturbances.  An  OLS  regression  equation  may  now  be 
..iployed  to  represent  a  functional  equation,  where  the  values  of  the 
unstandardized  regression  weights  represent  the  values  of  the  structural 
parameters.  Under  these  assumptions,  the  OLS  error  (residual)  term 
represents  accurately  the  disturbance  term.  However,  this  does  not 
suggest  that  one  may  relate  the  manifest  variables  with  the  OLS  error, 
designated  _e.  ,  to  test  the  assumption  that  the  manifest  variables  in  a 
functional  equation  are  unrelated  to  the  disturbance  of  that  equation.  By 
definition,  an  e.  will  be  unrelated  to  all  manifest  variables  included 
in  an  OLS  equation.  This  point  may  appear  trivial,  but  some  researchers 
have  in  fact  calculated  covariances  (correlations)  between  manifest 
variables  ana  j^'s  to  test  the  assunption  that  cause »  are  unrelated  to 
disturbances.  Still  others  have  defined  a  disturbance  as  a  form  of 
statistical  residual  (e.g.,  an  OLS  error  term) ,  which  is  misleading.  Tb 
reiterate,  a  disturbance  cannot  be  measured  directly,  and  the  covariation 
(correlation)  between  a  disturbance  arvd  manifest  variables  in  a  particular 
equation  cannot  be  solved  for,  or  estimated,  directly.  A  statistical 
residual,  such  as  an  _e^  in  OLS,  represents  (estimates)  a  disturbance 
only  in  the  condition  that  the  manifest  variables  in  an  equation  are 
theoretically  unrelated  to  the  disturbance  of  that  equation. 

Hopefully,  we  have  made  the  point  that  while  a  functional  equation 
involving  manifest  variables  and  an  OLS  equation  involving  manifest 


variables  may  assume  identical  forms,  it  is  only  in  the  functional 
equation  that  covariation  between  manifest  variables  and  a  disturbance  has 
meaning,  and  it  is  only  when  all  such  covariations  are  theoretically 
zero  that  an  OLS  equation  truly  represents  a  functional  equation.  Let  us 
now  proceed  to  the  question  of  estimation  given  that  the  OLS  population 
equation  is  an  accurate  representation  of  the  functional  equation.  This 
is  a  simple  step  because  it  is  identical  to  the  requirements  for 
estimating  population  un standardized  regression  weights  based  on  the 
unstandardized  regression  weights  derived  in  a  sample.  In  general,  the 
unstandardized  regression  weights  derived  in  a  sample  will  be  unbiased 
estimates  of  population  unstandardized  weights,  and  therefore  the 
structural  parameters,  if  the  sample  was  obtained  randomly  from  a 
well-defined  population  in  which  errors  (and  by  implication  disturbances) 
have  a  mean  of  zero  and  a  constant  variance  (i.e.,  homoscedasticity) .  (We 
are,  of  course,  also  assuming  linearity,  interval  scales,  and  essentially 
perfect  reliability)  . 

Let  us  presume  that  the  assumptions  above  have  been  reasonably 
satisfied.  An  illustration  of  the  empirical  estimation  of  structural 
parameters  by  means  of  a  structural  equation  analysis  on  a  recursive  model 
is  then  straightforward.  Consider,  as  an  example,  the  hypothetical  sample 
data  shown  in  Figure  2.8.  Figure  2.8a  is  a  replication  of  the  structural 
model  in  Figure  2.7,  only  here  we  have  placed  variances  of  the  variables 
in  parantheses  below  the  variables  and  covariances  in  parentheses  above 
the  arrows  connecting  the  variables.  The  sample  OLS  regression  equations 
(Eqs.  2.8a  through  2.8c)  in  Figure  2.8b  were  used  to  estimate  the  values 
of  the  structural  parameters  in  the  structural  equations  in  Figure  2.7. 
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That  is,  the  instandard ized  regression  weights  in  Bqs.  2.8a  through  2.8c 
(i.e.,  the  j3„)  were  used  as  estimates  of  the  in  Bqs.  2.7a 
through  2.7b.  A  separate  OLS  analysis  was  conducted  for  each  of  the  OLS 
regression  equations.  The  results  of  the  analyses  are  shown  in  Figure 
2.8c.  Given  reasonable  satisfaction  of  Conditions  1  through  8  and  the 

A 

assunptions  for  OLS  analysis,  the  are  unbiased  estimates  of  the 

-IX 


Insert  Figure  2.8  about  here 


Given  unbiased  estimates,  we  are  now  in  a  position  to  confirm  or 
disconfirm  the  predictions  that  estimates  of  structural  parameters 
associated  with  causes  in  functional  equations  should  be  significantly 
different  from  zero.  This  is  accomplished  by  employing  the  conventional 
significance  test  for  instandard  ized  regression  weights.  [This  test 
requires  the  additional  assumption  that  the  population  OLS  error  term  (and 
the  disturbance)  is  distributed  normally.]  If  all  estimated  structural 
parameters  for  a  particular  functional  equation  are  significant,  then  all 
predictions  are  confirmed  in  regard  to  our  first  test  of  confirmation.  If 
all  predictions  in  all  equations  are  confirmed,  then  the  structural  model 
is  regarded  as  being  consistent  with  the  data,  again  in  regard  to  the 
first  test  of  confirmation.  For  example,  given  a  large  sample,  the 
B.^  in  Figure  2.8c  would  be  significant,  which  suggests  that  the 
structural  model  in  Figure  2.7  is  confirmed  in  regard  to  the  first  test  of 
confirmation.  However,  if  one  or  more  estimated  structural  parameters  is 
not  significant,  then  not  only  are  the  predictions  associated  with  those 
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estimates  discontinued,  but  the  structural  model  is  regarded  as 
inconsistent  with  the  data. 

In  sum,  simple  multiple  regression  may  be  used  to  test  predictions 
that  estimated  structural  parameters  (unstandardized  regression  weights) 
associated  with  causes  are  significant.  If  the  estimated  parameters  are 
significant  in  each  and  every  statistical  (OLS)  equation,  then,  based  on 
the  first  test  of  confirmation/disconfirmation,  a  structural  model  is 
confirmed.  Confirmation  implies  that  the  structural  model  is  consistent 
with  the  data.  If  one  or  more  estimated  structural  parameters  is  not 
significant,  then  the  model  is  discontinued,  implying  that  the  model  is 
not  consistent  with  the  data.  Now,  given  the  consensus  that  confirmatory 
analysis  should  be  conducted  on  large  samples,  it  follows  that  the 
significance  tests  for  estimated  structural  parameters  are  powerful. 
Consequently,  unstandardized  regression  weights  of  rather  trivial 
magnitudes  are  likely  to  be  significant.  This  suggests  that  the  present 
test  is  not  likely  to  disconfirm  many  predictions,  or,  it  is  weighted  in 
favor  of  confirming  predictions.  As  we  shall  see  in  Condition  10, 
precisely  the  opposite  problem  occurs  in  our  second  test  of 
confirmatiorv'disconfirmation. 

Theory  trimming.  It  is,  of  course,  possible  for  instandardized 
regression  weights  to  be  nonsignificant,  even  with  large  samples.  How 
serious  is  this  source  of  discontinuation?  The  answer  to  this  question 
depends  on  the  model  and  the  salience  of  the  causal  hypothesis  underlying 
the  structural  parameter.  For  example,  in  Figures  2.7  and  2.8,  it  may  be 

critical  to  the  theory  underlying  the  model  that  the  estimate  of 

^  ...  .  .  .  ^ 
namely  B^#  be  significant.  Consequently,  a  nonsignificant  B^, 


or  a  significant  of  trivial  magnitude,  may  be  a  serious  blow.  On 
the  other  hand,  a  nonsignificant,  or  significant  but  low,  may  not 
be  a  serious  theoretical  blow.  Thus,  the  particular  source  of 
disconfirmation  of  predictions  (i.e.,  the  causal  hypotheses  involved)  may 
vary  from  rather  minor  to  drastic.  If  confirmation  of  a  prediction  is  of 
particular  importance  to  a  model,  and  the  prediction  is  disconfirmed,  then 
one  may  decide  that  there  is  little  reason  to  proceed  with  additional 
analyses.  We  recommend  that  they  do  proceed  in  order  to  test  fully  all  of 
the  predictions  of  the  model. 

If  the  source  of  disconfirmation  is  not  of  major  importance,  then  the 
investigator  may  consider  "theory  trimming".  Theory  trimming  consists  of 
deleting  causal  connections  from  a  structural  model  (Heise,  1969),  which 
is  to  say  deleting  a  cause  and  its  associated  structural  parameter  from  a 
functional  equation.  It  is  extremely  important  to  note  that  theory 
trimming  is  a  form  of  exploratory,  and  not  confirmatory,  analysis.  That 
is,  the  investigator  is  now  making  decisions  based  on  data,  and  is  no 
longer  confirming/disconfirming  a  priori  theory.  The  crucial  implication 
of  this  point  is  that  the  structural  parameters  for  which  trimming  might 
be  indicated  in  this  first  test  of  confirmation/disconfirmation  should 
never  be  used  to  test  confirmation/disconfirmation  in  the  second  test 
(Gondition  10).  The  reasoning  is  simple;  the  second  test  of 
confirmatioa/disconfirmation  assesses  whether  estimated  structural 
parameters  that  are  predicted  to  be  zero  are  in  fact  not  significantly 
different  than  zero.  If,  after  looking  at  the  data,  one  knows  that  an 
estimated  parameter  is  essentially  zero,  then  a  test  that  this  same 
paraneter  is  predicted  to  be  zero  is  nonsense. 
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In  conclusion,  theory  trimming  of  minor  sources  of  disconf innation  is 
acceptable  as  long  as  one  remembers  that,  following  trimming,  one  is  no 
longer  dealing  with  an  a  priori,  theoretical  model.  This  leaves  two 
avenues  for  further  analysis,  specifically  (1)  test  the  trimmed  model  in  a 
new  sample,  or  (2)  do  not  trim  prior  to  conducting  the  second  test  of 
confirmatiorVdisconfirmation  discussed  in  Oondition  10.  One  may  theory 
trim  after  all  analyses  are  conducted  and  interpreted,  and  the  researcher 
is  proposing  a  revised  theoretical  model  for  future  research. 

Standardized  versus  un standardized  manifest  variables 

Up  to  now  we  have  treated  manifest  variables  in  deviation  form,  one 
result  being  that  the  OLS  estimates  of  structural  parameters  are 
unstandardized  regression  weights.  It  is  also  possible  to  treat  the 
manifest  variables  in  standardized  form,  in  which  case  the  values  on  the 
variables  are  standard  scores  (i.e.,  z^  =  (X-p)/o)  ,  and  the  variables 
have  a  mean  of  zero  and  a  standard  deviation  of  1.0.  When  the  variables 
are  in  standardized  form,  the  structural  parameters  are  referred  to  as 
standardized  structural  parameters,  or  path  coefficients.  The  estimates 
of  path  coefficients  provided  by  OLS  are  "standardized  regression 
weights",  or  beta  weights.  Moreover,  it  is  customary  to  refer  to  the 
structural  model  as  a  path  model ,  the  functional  equations  as  path 
equations,  and  the  confirmatory  analysis  as  a  path  analysis.  A  full 
recursive  path  model  and  its  corresponding  set  of  path  equations  are  shown 
in  figure  2.9,  where  ^s  are  used  to  designate  standard  scores,  and 
(i>i)  designate  path  coefficients.  It  is  customary  to  employ  the 
symbol  "_u^"  to  designate  disturbance  terms  in  path  models  and  path 
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equations. 

Insert  Figure  2.9  about  here 

There  are  advantages  and  disadvantages  associated  with  both  path 
coefficients  and  (unstandardized)  structural  parameters  (cf.  Uikey,  1964; 
Wright,  1960)  .  The  advantages  of  path  coefficients  are  as  follows: 

(1)  Algebraic  and  statistical  manipulations  are  simplified. 

(2)  Path  coefficients  are  based  on  readily  interpretable 
correlation  coefficients. 

(3)  Path  coefficients  themselves  are  readily  interpretable 
inasmuch  as  all  variables  are  based  on  the  same  metric. 

(4)  Path  analysis  provides  simplified  expressions  for 
decomposing  correlations  into  functions  of  path  coefficients.5 

Structural  parameters  typically  have  none  of  the  advantages  above 
because  the  variables  are  based  on  different  metrics  (see  Figure  2.8)  and 
variance-covariance  matrices  replace  correlation  matrices.  However, 
structural  parameters  have  the  advantage  that  they  can  be  compared  directly 
across  different  populations  (for  the  same  variables)  or  for  the  same 
population  over  time.  We  have  chosen  to  illustrate  the  former  case  because 
we  wish  to  demonstrate  how  the  subgrouping  moderator  approach  can  be 
applied  in  confirmatory  analysis.  By  doing  so,  our  intention  is  to 
discourage  carrying  over  into  confirmatory  analysis  the  practice  among  some 
investigators  of  conducting  subgrouping  moderator  analysis  on  standardized 
regression  weights  (beta-weights,  correlation  coefficients  in  the  bivariate 


case) 


♦ 
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Consider  a  linear  bivariate  model  in  which  both  variables  and 
x^)  have  been  measured  in  each  of  tw  specified  populations  (e.g., 
males  and  females)  .  The  path  equation  in  each  population  is  = 
jo^z^  +  the  corresponding  structural  equation  in  each  population 
is  ^2  =  —  2  +  —21—1  +  —1*  strLCtural  equation  is  presented 

in  raw  score  form,  where  A2  is  the  intercept.  The  structural  parameter 
B2 j  may  be  thought  of  as  the  slope  of  a  regression  line,  where  the 
slope  reflects  the  amount  of  change  in  X 2,  in  units  of  X2,  brought 
about  by  a  unit  of  change  in  X^.  This  suggests  that  the  structural 
parameter  reflects  the  concrete  contribution  that  X^  makes  directly 
to  X2  in  X2  units-  if  the  same  metrics  are  employed  across  the  two 
populations,  then  the  contribution  that  X^  makes  to  X2  can  be 
compared  across  the  populations  in  an  absolute  sense;  that  is,  in  terms 
of  units  of 

The  path  coefficient  p21  may  also  be  thought  of  as  the  slope  of  a 
regression  line.  However,  the  path  coefficient  is  an  abstract  measure 
of  the  slope  because  it  is  based  on  an  abstract  scale  that  varies  as  a 
function  of  a  ratio  of  standard  deviation  units  within  a  particular 
population.  That  is,  £21  =  M.21  ^ax /ax  )»  which  means  that 

path  coefficients  are  unstandardized  regression  weights  adjusted  by  a 
ratio  of  the  standard  deviations  in  each  population.  Thus,  a  comparison 
of  path  coefficients  is  in  actuality  a  comparison  of  abstract  scales,  each 
adjusted  for  population  idiosyncrasies  in  standard  deviations.  This 
suggests  that  while  the  concrete  contribution  (causal  effect)  indicated  by 
B^  may  remain  invariant  across  populations,  the  path  coefficients  may 
vary  simply  because  the  standard  deviations  of  X^  and/or  X2  vary 
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across  populations.  Thus,  the  general  rule  is  that  path  coefficients 
should  not  be  employed  in  the  comparison  of  causal  effects  for  different 
populations,  or  in  the  comparison  of  causal  effects  in  the  same  population 
over  time  (cf.  Blalock,  1967,  1968,  1969;  Spaeth,  1975;  Tukey,  1964; 

Wiley  &  Wiley,  1971)  . 

On  the  other  hand,  path  analysis  has  the  advantages  discussed  earlier. 
V*ien  research  is  focused  on  a  specific  population  using  cross-sectional 
data  collected  during  an  equilibr inn- type  condition,  the  advantages  may 
outweigh  the  disadvantages  (Wright,  1960).  For  our  purposes,  the 
opportunity  to  simplify  statistical  derivations  is  important,  and  thus  a 
path  analysis  paradigm  is  employed  frequently  in  the  remaining  discussion. 
Specification  Error  Due  to  an  Uhmeasured  Relevant  Cause 

Specification  error  is  a  general  term  that  refers  to  errors 
(misspecifications)  in  the  form  of  a  structural  model  and  its  accompanying 
functional  equations,  or  in  the  operationalization  of  variables.  In  this 
discussion  we  will  demonstrate  a  specification  error  of  particular  concern 
to  psychologists,  namely  unmeasured  relevant  causes.  We  will  also  return 
to  the  question  of  relations  between  causes  and  disturbances,  the  resulting 
under identification,  and  the  consequences  of  proceeding  with  OLS  estimation 
given  violation  of  conditions  for  confirmatory  analysis. 

In  the  presentation  of  Condition  5,  we  demonstrated  that  an  unmeasured 
relevant  cause  would  result  in  a  theoretical  covariation  between  causes 
included  explicitly  in  a  functional  equation  and  the  disturbance  term  of 
that  equation  (see  Figure  2.4  and  Bq.  2.11).  We  also  demonstrated  that 
covariation  between  a  measured  cause  and  a  disturbance  implies  that  a 
functional  equation  is  not  self-contained.  Finally,  we  noted  that 
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psycho logists  have  paid  scant  attention  to  the  need  for  self-contained 
functional  equations,  although  increased  attention  is  not  likely  t.o 
solve  this  problem  completely  because  all  relevant  causes  of  an  effect  are 
unlikely  to  be  known.  Nevertheless,  it  is  possible  to  reduce  substantially 
the  influence  of  unmeasured  relevant  causes,  or  what  we  will  refer  to  as 
the  "unmeasured  variables  problan",  by  (a)  recognizing  that  the  problem 
exists,  (b)  judging  whether  sufficient  information  has  accumulated  to 
justify  a  confirmatory  analysis,  and  (c)  including  all  known  relevant 
causes  in  the  appropriate  functional  equations.  Given  (a)  and  assuming  an 
affirmative  answer  to  (b) ,  the  operational  question  is  (c) ,  namely  have 
known  relevant  causes  been  omitted  from  one  or  more  functional  equations. 

As  we  shall  see,  this  question  consists  of  postulating  the  degree  to  which 
an  unmeasured  variables  problem  is  believed  to  lead  to  bias  in  estimates  of 
structural  parameters. 

Within  psychology,  a  primary  cause  of  an  unmeasured  variables  problem 
appears  to  be  the  failure  to  recognize  that  "focused"  theoretical  models 
are  not  amenable  to  confirmatory  analysis,  lb  illustrate,  it  is  frequently 
the  case  that  psychological  models  are  designed  around  sets  of  focused 
causal  variables,  where  by  "focused"  we  mean  causes  that  are  of  interest  to 
an  investigate r(s) .  Leader  behaviors  are  one  example.  In  contrast,  the 
endogenous  variables  are  typically  less  focused,  and  include  such  things  as 
attrition,  overall  satisfaction,  and  performance.  Che  of  the  certain  ways 
to  create  a  serious  unmeasured  variables  problem  is  to  analyze  focused 
causal  variables,  such  as  leader  behaviors,  in  relation  to  a  g lobal 
endogenous  variable,  such  as  overall  job  satisfaction.  The  latter  includes 
a  plethora  of  additional  causal  influences,  such  as  pay,  working 
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conditions,  opportunities  for  advancement,  job  stability,  and  so  forth. 

By  way  of  simple  illustration.  Figure  2.10a  displays  a  self-contained 
path  model  in  which  the  manifest  exogenous  variable  z^  is  a  cause  of 
the  two  endogenous  variables,  an<3  £3*  ,z2  *s  a^so  a  cause  of  Zy 

The  _uk  (u^  and  u^)  are  disturbance  terms,  and  are  assumed  to  be 
comprised  exclusively  by  random  shocks  (RS) .  (It  is  assumed  that,  with  the 
exception  of  an  unmeasured  variables  problem  in  Figure  2.10b,  all  conditions 
for  confirmatory  analysis  have  been  satisfied.) .  Population  path  equations, 
expectations,  and  normal  equations  for  Model  2.10a  are  shown  in  the  upper 
portion  of  Figure  2.11.  Note  that  the  functional  equations  are  identified; 
the  normal  equation  for  z^  (Eq.  2.11a)  has  one  equation  and  one  unknown 
(i.e.,  p21  =  r_2])  /  and  the  normal  equations  for  (Eqs.  2.11b  and 
2.11c)  have  two  unknowns  in  two  equations.  The  correlations  are  assumed  to 
be  known,  and  are  indicated  by  "r .  .  [These  are  population  correlations; 

the  conventional  £  is  not  used  because  it  is  easy  to  confuse  p 
(population  correlation)  with  £  (path  coefficient)  .] 


Insert  Figures  2.10  and  2.11  about  here 


Figure  2.10b  illustrates  a  condition  in  which  the  u^  are  not 
comprised  by  RS  components  exclusively.  Rather,  an  unmeasured  causal 
variable,  a,  is  present  in  both  terms.  The  variable  a  is 

regarded  as  a  relevant  cause  of  both  z ^  and  Zy  which  includes 
relationships  with  _z^  (indicated  by  the  curved,  double-headed  arrows 
from  a  to  z^) .  Moreover,  because  £  appears  in  both  disturbance 
terms  (i.e.,  the  curved  arrow  between  a  for  z 2  and  a  for  z^)  ,  the 
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disturbances  are  also  correlated.  (The  a' s  need  not  be  the  same  for  this 
to  occur;  different  but  correlated  a' s  for  z^  and  also  lead  to 
correlated  disturbances.) 

Path  equations,  expectations,  and  normal  equations  for  Figure  2.10b 

are  showi  in  the  lower  portion  of  Figure  2.11.  As  showi  in  the  normal 

equation  for  z 2  (Eq.  2. lid)  and  the  normal  equations  for  z ^  (Eqs. 

2. lie  and  2. Ilf)  ,  the  equations  are  under  identified  because  the  unmeasured 

a  covaries  with  both  z ^  and  z ^  (i.e.,  Eq.  2. lid  has  two 

unknowns,  and  Eqs.  2. lie  and  2. Ilf  contain  four  unknowns — the  relations 

between  a  and  the  z' s  are  represented  as  covariances  because  if  ch  = 

RS  +  a.,  and  if  d.  is  in  standardized  form,  then  a.  cannot  be  in 
—  ~±  —±  -i 

standardized  form) .  Consequently,  OLS  can  not  be  used  to  solve  for  the 
path  coefficients  in  the  population  or  to  estimate  the  coefficients  in  a 
sample. 

Now,  consider  the  case  that  the  path  model  displayed  in  Figure  2.10b 
is  operable,  but  that  an  investigator  assumes  that  u 2  and  u^  are 
comprised  by  RS  components  exclusively.  It  is  possible  to  use  OLS  to 
solve  for  the  path  coefficients;  however,  the  path  coefficients  will  be 
biased.  Tb  illustrate  the  bias,  a  false  model  (a  is  not  included  in  the 
path  equation)  is  compared  to  a  true  model  (_a  is  included  in  the  path 
equation)  in  order  to  determine  the  consequences  of  employing  the  false 
model  to  solve  for  the  path  coefficient(s)  (Duncan,  1975;  James,  1980).  A 
comparison  of  the  false  and  true  models  for  the  z 2  equation  is 
presented  in  Figure  2.12.  The  "true"  path  coefficients  are  designated  by 
primes  (e.g.,  p'^  and  jo^)  . 
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Insert  Figure  2.12  about  here 


As  shown  in  Eq.  2.12a,  i-n  the  false  model  differs  from  in 
the  true  model  by  a  factor  of  P^a^al*  lmPlies  that  £2l  he 

biased  if  both  and  _r^  are  greater  than  zero.  In  other  words,  if  the 
unmeasured  a  is  (a)  a  cause  of  _z2  and  (b)  correlated  with  z^,  then 
£21  will  be  biased.  If  we  disregard  suppressors,  then  the  bias  will  result 
in  a  that  is  too  large;  this  is  a  direct  result  of  failure  to  control  for 
the  effects  of  a  in  solving  for  in  the  false  model.  The  ramification 
of  this  bias  is  that  causal  influence  that  rightfully  belongs  to  a  is 
instead  attributed  to  z.. 

Let  us  now  address  the  question  of  the  degree  to  which  p2i  may  be 
biased.  If  either  p^  or  is  zero,  or  approximately  zero,  then  little 
or  no  bias  will  exist  in  ja^.  This  suggests  that  bias  will  not  occur  if  an 
unmeasured  variable  is  in  fact  a  cause  of  the  endogenous  variable  but  is 
unrelated  to  the  measured  causes  of  that  same  variable.  (Note  that  the 
unmeasured  cause  is  not  a  relevant  cause  if  it  is  'unrelated  to  measured 
causes.)  Consequently,  one  need  not  assume  that  all  major  causes  of  an 
endogenous  variable  have  been  measured.  Rather,  an  unmeasured  cause  must 
also  be  related  to  a  measured  cause  before  bias  will  ensue. 

We  must  also  entertain  the  fact  that  there  are  degrees  of  causation; 

0 

the  magnitude  of  might  be  anywhere  on  a  continuum  from  low,  to 
moderate,  to  high.  Similarly,  the  magnitude  of  r  may  vary  from  zero, 
or  approximately  so,  to  low,  moderate,  or  high.  Ihus,  the  product  term 
p^r^l  can  assume  many  values,  only  some  of  which  result  in  serious 
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bias  of  £21*  From  a  pragmatic  standpoint,  we  shall  assume  that  the 
permutations  most  likely  to  lead  to  serious  bias  are  high-high, 
moderate-high,  high-moderate,  and  moderate-moderate.  Moreover,  an 
unmeasured  variable  will  not  result  in  bias  if  this  variable  is  highly 
correlated  with  a  measured  cause.  This  point  is  easily  demonstrated  in  the 
"true"  equation  for  z^,  which  is  presented  below  with  appropriate 
controls  indicated  for  the  path  coefficients. 

—2  =  -21-a^l  +  P|a-1^  +  ^2 

The  path  coefficient  for  a  is  theoretically  equal  to,  or 
approximately  equal  to,  zero  if  a  and  z ^  are  correlated  highly  (e.g., 

.95) .  Consequently,  there  is  no  reason  to  include  a  in  the  z 2  equation 
because  it  is  essentially  redundant  with,  or  linearly  dependent  on,  z  . 

Note  also  that  inclusion  of  a  in  the  z 2  equation  would  result  in  a 
multicol linearity  problen  if  OLS  were  used  to  solve  for  the  path  coefficients 
Thus,  with  a  unmeasured,  essentially  no  bias  will  ensue  for  the  p21  path 
coefficient  (i.e.,  =  0  k6031-156  P^a-i  =  °)  •  Here  again,  a 

would  not  be  an  unmeasured  relevant  cause  if  it  is  linearly  dependent  on 
the  causes  already  included  in  an  equation. 

In  sum,  an  unmeasured  variable  must  be  a  relevant  cause  before  it  will 
lead  to  bias  in  the  solutions  (estimates)  of  path  coefficients  for  measured 
causes.  In  particular,  it  must  have  at  least  a  moderate  effect  on  the 
endogenous  variable  after  controls  have  been  effected  for  measured  causes, 
and  it  must  have  at  least  a  moderate  correlation  with  one  or  more  measured 
causes  without  being  linearly  dependent  on  the  measured  causes.  Ttie 
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implication  of  these  points  is  that  if  an  unmeasured  causal  variable  is  not 
a  relevant  cause,  then  one  does  not  have  a  serious  unmeasured  variables 
problem. 


If,  however,  the  unmeasured  variable  is  a  relevant  cause,  then  as 
shovn  in  Figure  2.12,  bias  will  ensue  in  the  solution  (estimate)  of  a  path 
coefficient(s) .  This  point  is  illustrated  empirically  in  Figure  2.13. 

Figure  2.13a  is  a  self-contained  or  true  model  with  relevant  causes  and 
a  (in  standardized  form)  in  the  path  equation  for  z^.  Hypothetical 
population  correlations  associated  with  the  relation  between  the  two 
exogenous  variables  (i.e.,  r^  =  .50)  and  the  causal  connections  are 
presented  in  parentheses.  Solutions  for  the  path  coefficients  in  the  true 
model,  designated  by  a  prime,  were  obtained  by  OLS;  the  path  coefficients  (beta 
weights)  were:  =  .50  and  =  .40.  A  false  model  with  a 

unmeasured  is  shovn  in  Figure  2.13b.  The  OLS  solution  for  p21  is  .70,  or 
simply  £21*  The  bias  in  p2^  resulting  from  the  failure  to  include  a 
in  the  path  equation  for  £2  *s  shovn  in  Figure  2.13c.  The  bias  is  equal  to 
£2l  -  £21  =  *70  ~  *50  =  '20*  This  is  equivalent  to  ~ 

(.40) (.50)  =  .20. 


Insert  Figure  2.13  about  here 


The  discussion  above  transfers  directly  to  more  complex  equations 
involving  multiple  causes,  although,  given  an  unmeasured  relevant  cause, 
the  direction  of  bias  may  be  either  positive  or  negative.  Nevertheless, 
the  unmeasured  cause  must  be  a  relevant  cause  before  serious  bias  will 
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ensue  in  the  solutions  (estimates)  of  path  coefficients.  We  will  forego  a 
statistical  demonstration  of  this  point.  Rather,  we  shall  proceed  to 
surma rize  the  salient  points  from  the  preceding  discussion  in  the  form  of 
"decision  steps".  The  decision  steps,  presented  by  James  (1980),  are 
designed  to  assist  investigators  in  making  a  judgment  of  whether  an 
unmeasured  variables  problem  is  of  sufficient  seriousness  to  preclude  the 
use  of  path  analysis  (or  other  forms  of  confirmatory  analysis)  .  The 
decision  steps  are  prefaced  on  the  logic  that  while  it  is  unrealistic  to 
expect  obviation  of  the  unmeasured  variables  problem  in  research,  it  is 
possible  to  attempt  to  minimize  bias  in  path  coefficients  (structural 
parameters)  to  the  point  that  the  bias  is  within  "tolerable  limits"  for 
research  purposes.  It  is  also  prefaced  on  the  logic  that  investigators 
will  exercise  "good  judgment"  in  deciding  initially  that  enough  information 
has  accumulated  to  justify  confirmatory  analysis,  which  is  a  pragmatic  but 
defensible  reason  for  focusing  on  known  causes.  (We  address  this  issue 
further  in  Section  3.) 

The  decision  steps  are  presented  in  Figure  2.14.  The  steps  are 
written  from  the  standpoint  of  designing  a  cross-sectional,  manifest 
variable  analysis  involving  recursive  causal  connections,  although  the 
rationale  generally  transfers  to  other  types  of  designs.  Furthermore,  the 
steps  apply  to  one  endogenous  variable,  and  should  be  applied  to  each 
endogenous  variable  in  a  structural  model. 


Insert  Figure  2.14  about  here 


* 

The  decision  steps,  while  subjective,  are  the  most  help  that  we  can 


give  reserchers  who  are  using  cross-sectional  designs  with  manifest  (or 
latent)  variables.  However,  with  longitudinal  designs,  it  is  possible  to 
minimize  statistically  the  bias  in  estimates  of  structural  parameters 
resulting  from  unmeasured  relevant  causes.  Of  particular  importance  is 
time  series  analysis  combined  with  generalized  differencing  and  generalized 
least  squares  (cf.  James  &  Singh,  1978;  Johnston,  1972;  Ostrom,  1978; 
pindyck  &  Rubinfeld,  1976) .  The  logic  of  these  methods  is  that  if  a 
disturbance  term  is  estimated  for  the  same  endogenous  variable (s)  at  each 
of  several  times  of  measurement,  it  is  possible  to  estimate  relationships 
among  the  disturbance  terms.  If  the  disturbance  terms  are  comprised  by 
randan  shocks  exclusively,  then  these  relationships  will  be  zero,  or 
approximately  so.  If,  however,  unmeasured  causes  (relevant  and  nonre levant) 
are  contained  in  the  estimated  disturbance  terms,  then  it  follows  that  the 
unmeasured  causes  will  correlate  with  themselves  over  time  because  they 
are,  by  definition,  stable.  These  correlations  are  referred  to  as 
autocorrelations,  or  serial  correlations,  among  the  disturbance  terms. 
Statistical  procedures  can  be  used  to  remove  the  serial  correlation  from 
the  estimated  disturbance  terms,  and,  in  the  process,  obviate  relationships 
between  measured  causes  and  disturbances.  The  result  is  that  consistent 
estimates  of  structural  parameters  may  be  obtained  even  though  one  may  have 
a  serious  unmeasured  variables  problem. 

Summary.  Considerable  territory  has  been  covered  in  presenting 
Condition  9,  and  there  is  more  to  come  in  the  discussion  of  latent 
variables.  Nevertheless,  with  a  minimum  of  statistical  development  we  have 
attempted  to  make  three  points.  First,  predictions  regarding  estimates  of 
structural  parameters  (standardized  or  unstandardized)  must  be  confirmed  if 
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a  structural  model  is  to  be  consistent  with  empirical  data,  This  is  the 
first  test  of  confirmation/d isconfirmation.  Second,  erroneous  conclusions 
may  be  drawn  regarding  confirmation/disconfirmation  of  predictions  if  the 
estimates  of  structural  parameters  are  biased.  Third,  the  degree  of  bias 
in  estimates  of  structural  parameters  is  a  function  of  specification  errors 
brought  about  by  the  failure  to  satisfy  reasonably  one  or  more  of  the 
preceding  eight  conditions.  An  illustration  was  provided  for  unmeasured 
relevant  causes  (a  violation  of  the  self-containment  condition) .  An 
illustration  of  the  effects  of  randcm  measurement  error  in  manifest 
variables  is  presented  in  Section  4.  Illustrations  of  the  effects  of 
violations  of  other  conditions,  such  as  specification  errors  in  causal 
order,  causal  direction,  causal  interval,  and  operationalization  of 
variables,  are  furnished  in  the  confirmatory  literature  (cf.  Billings  & 
Wroten,  1978;  Bohrnstedt  &  Carter,  1971;  Cook  &  Campbell,  1976;  Darlington 
&  Rom,  1972;  Deegan,  1974;  Goldberger  &  Duncan,  1973;  Griffin,  1977;  Heise, 
1969;  Werts  &  Linn,  1971;  Young,  1977;  Cliff,  1980). 


The  estimation  of  structural  parameters  in  Condition  9  is  based 
(conditional)  on  the  assumption  that  the  structural  model,  as  proposed 
originally,  is  valid.  It  is,  however,  quite  possible  for  all  of  the 
estimated  parameters  to  be  significantly  different  fran  zero  when  the 
structural  model  is  invalid.  Tb  illustrate,  consider  the  two  path  models 
and  their  accanpanying  path  equations  in  Figure  2.15.  Model  2.15a  predicts 
that  similarity  of  interests  (z^)  leads  to  felt  attraction  (z2) ,  which 
in  turn  leads  to  a  higher  probability  to  form  a  friendship  (z^) .  This  is  a 
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simple  chain  model  in  vrfiich  a  direct  effect  is  not  hypothesized. 

Rather,  the  effect  of  z^  on  is  indirect,  which  is  to  say  that  the 
effect  of  similarity  of  interests  on  friendship  is  mediated  by  the 
intervening  process  of  experienced  attraction.  Model  2.15b  predicts  that 


z^  affects  both  z^ 

and  Zj 

directly, 

where 

both  z^  and  z2 

are  relevant  causes 

of 

(z^  also 

affects 

z 2  indirectly  through 

z^) .  If  Model  2.15a  were  tested  in  Condition  9,  vdien  in  fact  Model  2.15b 
is  the  true  model,  then  the  estimate  of  the  path  coefficient  for  z2  in  Bq. 

2.15b  (i.e.,  would  be  biased.  This  would  occur  even  though  estimates 

of  in  Bq.  2.15a  and  £^2  *n  2.15b  were  significant.  The 

rationale  here  is  that  failure  to  include  a  measured  relevant  cause  in  a 
functional  equation  (i.e.,  z^  in  Bq.  2.15b)  has  the  same  implications  as 
failure  to  include  an  unmeasured  relevant  cause  (i.e.,  lack  of  self-  containment 
and  biased  parameter  estimates) .  However,  unlike  unmeasured  relevant  causes, 
it  is  possible  to  test  empirically  for  direct  causal  effects  for  measured 
causes.  This  is  a  form  of  goodness  of  fit  test,  and  brings  us  to  the  second 
test  of  confirmation/disconfirmation. 


Insert  Figure  2.15  about  here 


The  key  condition  required  for  the  second  test  of  confirmation/ 
disconfirmation  is  that  no  direct  causal  connection  is  hypothesized 
between  at  least  one  causal  variable  and  an  endogenous  variable  in  the 
original  structural  model.  This  is  analogous  to  hypothesizing  that  the 
value  of  the  structural  parameter  relating  these  two  variables  is  zero. 
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For  example,  in  Figure  2.15a,  there  is  no  direct  arrow  between  and 
z^;  this  means  that  p31  *s  hypothesized  to  be  equal  to  zero.  Thus,  the 
path  equation  for  in  Figure  2.15a  (Eq.  2.15b)  may  be  thought  of  as 

—3  =  (£31  +  £32^2  +  -3  =  £32^-2  +  -3*  Jt  cannot  be 

emphasized  strongly  enough  that  setting  structural  parameters  equal  to  zero 

must  be  based  on  theoretical  grounds  (_i .e . ,  accompanied  by  a  theoretical 

rationale)  and  proposed  as  part  of  the  original  structural  model . 

lb  continue  the  illustration,  the  hypothesis  that  p31  =  0  in  Figure 

2.15a  and  Eq.  2.15b  leads  directly  to  the  prediction  that  the  OLS  estimate  of 

£3^  should  not  differ  significantly  from  zero.  This  hypothesis  can  be 

tested  by  estimating  j>31  in  Bq.  2.15d,  which  is  a  test  of  the  goodness  of 

fit  of  model  2.15a.  If  the  estimate  of  jo^  is  significant,  then  model 

2.15a  is  disconfirmed  and  model  2.15b  is  a  viable  (but  intestable,  as 

explained  below)  possibility.  This  form  of  test  is  referred  to  as  the 

omitted  paraneter  test  of  goodness  of  fit,  and  is  described  later  in 

this  section.  A  test  that  accomplishes  essentially  the  same  purpose,  but 

in  a  more  subtle  manner,  is  to  demonstrate  that  the  correlation  between  z^ 

and  z^  is  equal  to  the  product  of  the  paths  that  link,  indirectly,  z^ 

to  Zy  or  £31  *  P2l£32'  in  Fi(3ure  2.15a.  This  is  referred  to  as 

the  reproduced  correlation  test.  The  reproduced  correlation  test,  or 

reproduced  variance-covariance  test  if  variables  are  not  standardized,  is 

prominent  in  theoretical  discussions  of  confirmatory  analysis,  and  is 

described  prior  to  the  omitted  parameter  test.  It  is  noteworthy  that  the 

reproduced  correlation  (variance-covariance)  test  is  not  often  used  in 

manifest  variable  designs  because  significance  tests  are  not  available  if 

OLS  is  used  to  estimate  structural  parameters  (Kenny,  1979) .  Ch  the  other 


hand,  it  is,  at  the  present  time,  the  method  of  choice  in  latent  variable 
analysis,  where  significance  tests  do  exist. 


The  objectives  of  this  section  are  (a)  to  describe  the  reproduced 
correlation  test  and  the  omitted  parameter  test,  (b)  to  demonstrate  the 
relations  between  the  tests,  and  (c)  to  combine  the  two  tests  of 
confirmatiorv'disconfirmation  (i.e..  Condition  9  and  Condition  10).  Path 
models  based  on  a  cross-sectional  design  are  used  to  illustrate  the 
process.  Initial  derivations  are  based  on  population  data,  although  the 
processes  are  the  same  for  the  sample  data.  We  assume  that  Conditions  1 
through  8  and  the  assumptions  required  for  OLS  have  been  resonably 
satisfied. 

Reproduced  Correlation  Test 

We  begin  with  the  path  model  shown  in  Figure  2.16  (which  is  the  same 

as  Figure  2.9) .  The  key  to  the  reproduced  correlation  test  is  to  decompose 

• 

correlations  among  variables  in  a  structural  model  into  functions  of  path 
coefficients  and,  if  necessary,  unanalyzed  correlations  among  exogenous 
variables  (the  latter  case  is  not  treated  in  our  examples) .  Aside  from  the 
correlation  of  a  variable  with  itself,  there  are  six  population  correlations 
associated  with  Figure  2.16?  namely  £21,  £31,  r32,  r^,  £42'  and 

r43  (the  order  of  subscripts  reflects  causal  priorities,  although  this  is 
arbitrary) .  The  decomposition  may  be  accomplished  by  the  long  method  using 
expectations  (cf.  Duncan,  1975)  or  by  the  short  method  using  the  following 
equation  (Namboodiri  et  al.,  1975): 
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-H  =  Z  £il^si 


(2.18) 


"il  k 

^  represents  the  endogenous  variable  (_i  >  jt,i) ,  refers  to  the 
causal  variable,  and  _k  begins  with  _i-l  and  ranges  dowi  to  1  (i.e.,  z^)  . 
The  short  method  is  employed  below,  where  correlations  of  the  form 
(k  =  j)  are  deleted  because  they  are  equal  to  1.0. 


Insert  Figure  2.16  about  Here 


Direct  application  of  Eq.  2.18  provides: 
-21  =  £21 


(2.19) 


-31  =  £32^21  +  £31 
-32  =  £32  +  £31-12 


^41  =  £i>£31  +  ^42-21  +  £41 
^42  =  £43^32  +  £42  +  £41-12 
£43  *  £43  *  £42-23  +  £41-13 


Eqs.  2.19  through  2.24  do  not  represent  a  full  decomposition;  they  are 
the  normal  equations  used  to  solve  for  the  path  coefficients  in  Bqs.  2.16a 
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through  2.16c  in  Figure  2.16.  Dotted  lines  separate  the  normal  equations 
associated  with  each  path  equation.  As  discussed  in  regard  to 
identification  in  Condition  9,  it  is  possible  to  compute  the  six 
correlations  directly  from  the  data  and  thus  the  correlations  are  "knovm 
values".  There  are  also  six  unknown  path  coefficients.  When  the  number  of 
known  values  (normal  equations)  for  each  path  equation  is  equal  to  the 
number  of  unknown  values,  each  path  equation  is  referred  to  as  just 
identified  or  exactly  identified.  If  all  equations  are  just  identified, 
then  the  path  model  is  said  to  be  just  identified  (exactly  identified) . 

This  means  that  one  has  a  sufficient  number  of  knowns  (six  correlations)  to 
estimate  the  number  of  unknowns  (six  path  coefficients)  in  the  path  model. 
In  just  identified  models,  all  of  the  known  information  is  employed  to 
calculate  the  unknown  path  coefficients. 

Assuming  that  OLS  has  justifiably  been  used  to  solve  for  the  path 
coefficients  (i.e.,  the  jr ^  and  population  OLS  beta-weights  are 
equivalent)  ,  we  can  now  fully  decompose  the  correlations  by  replacing 
the  correlations  on  the  right  side  of  Bqs.  2.19  through  2.24  with  their 
decompositions  from  earlier  equations.  For  example,  *n  BJ*  2.20  and 
£12  in  Eq.  2.21  are  equal  to  p21  (Eq.  2.19).  ^jplying  this  process 
to  the  remaining  equations  results  in  the  following  set  of  decomposed 
correlations. 


-21  =  £21 

(2.25) 

hi  *  P32E21  +  P31 

(2.26) 

-32  *  £32  +  £31%1 

(2.27) 
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£41  =  £43£32£21  +  ^43^31  +  -24^21  +  %1 
±42  =  %3^32  +  %3^31£21  +  -%2  +  %1£21 
£43  =  £43  +  £42^32  +  +  -^41^32^21  +  -E41E3I 


The  objective  of  these  "decomposition  equations"  is  to  see  if  one  can 
reproduce  the  correlations  with  functions  of  path  coefficients.  This  is 
clearly  circular  for  these  equations.  That  is,  if  all  the  known 
information  (correlations)  is  used  to  solve  for  the  unknown  information 
(path  coefficients)  in  the  normal  equations  (Eqs.  2.19  through  2.24) ,  then 
it  should  be  possible  to  reverse  the  process  and  reproduce  the  correlations 
exactly  by  functions  of  the  path  coefficients.  This  is  precisely  the  case; 
Eqs.  2.25  through  2.30  will  reproduce  exactly  the  original  six 
correlations  used  to  solve  for  the  path  coefficients  in  Eqs.  2.19  through 
2.24.  Thus,  in  just  identified  models,  goodness  of  fit  tests  are 
inappropriate  inasmuch  as  solutions  of  path  coefficients  and  reproduction 
of  correlations  are  circular  or  mirror  reflections  of  the  same  statistical 
process. 

It  is  possible  to  break  out  of  this  circular  process  by  omitting  one 
or  more  causal  connections  from  the  path  model  and  path  equations.  This  is 
accomplished  by  assuming  a  priori  that  at  least  one  path  coefficient  is 
equal  to  zero;  that  is,  a  does  not  cause  a  (£  >  j)  directly. 

For  example,  in  Figure  2.16,  suppose  it  is  assumed  that  does  not 
cause  z^  directly.  Mathematically  this  is  equivalent  to  assuming  that 
jo^  =  0.  The  new  path  equation  for  is  now 


(2.31) 


%  =  £ii~i  +  -B42?’2  +  £4 

The  effects  of  this  assumption  may  be  demonstrated  with  the  normal 
equations  (cf.  Duncan,  1975),  although  it  is  straightforward  to  deal 
directly  with  the  decomposition  equations,  lb  see  the  results  of  the 
assumption  that  =  0,  we  will  emit  all  terms  in  Bqs.  2.25  through 
2.30  that  involve  J^-j.  The  terms  to  be  omitted  have  a  line  crossed 
through  them. 


-21  ~  £21 


-31  "  ^32^21  +  £.31 


-32  £32  +  ^31£21 

£41*  "  ^3^^21  +-%^31  +  £42^21  +  £41 
■^42*  =  +  £43%^21  +  %2  +  %l£21 


£43*  V£tff  +  £42^32  +  £42£3l£21  +  £4l£32£21  +  £4l£31 


(2.32) 

(2.33) 

(2.34) 

(2.35) 

(2.36) 

(2.37) 


Of  initial  importance  is  the  fact  that  there  are  still  six  known 
correlations.  However,  only  five  unknowns  now  exist  (i.e.,  jo^  is 
considered  known  and  eqAted  to  zero)  .  Vhen  more  knowns  exist  than 
unknowns  in  the  noqj^l  and  decomposition  equations,  the  model  is  said 
to  be  overtflenMfied .  (Technically,  the  path  equation  is 
over  identified  because  z ^  was  deleted  from  Bg.  2.31). 

O/er identification  has  a  number  of  technical  implications;  however,  our 
concern  here  is  with  the  substantive  outcome.  This  outcome  is  that  (a)  any 
correlation  in  the  set  of  six  correlations  represented  by  Eqs.  2.32 
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equation  (Eq.  2.17c)  .  The  decomposition  equations  are  showi  in  Figure 
2.17c.  All  terns  involving  have  a  line  crossed  through  them  (i.e.,  are 
omitted),  which  results  in  r^*,  I42*'  I43*  b^ng  subject  to  an 
overidentifying  restriction.  TWo  different  hypothetical,  population 
correlation  matrices  for  the  model  in  Figure  2.17a  are  show*  in  Figure  2.17d. 
Given  that  all  conditions  and  assumptions  for  confirmatory  analysis  and  OLS 
have  been  reasonably  satisfied,  the  population  OLS  solutions  of  the  path 
coefficients  for  each  of  these  correlation  matrices  are  reported  in  Figure 
2.17e.  (Only  the  correlations  differ;  the  path  model  is  the  same  for  both 
correlation  matrices) .  The  observed  correlations  subject  to  an  over  identifying 
restriction  and  the  reproduced  correlations,  based  on  the  decomposition 
equations  in  Figure  2.17c,  are  presented  in  Figure  2.17f.  For  matrix  A,  the 
observed  and  reproduced  correlations  are  identical,  which  suggests  that  (a) 

*  0;  (b)  the  path  model  in  Figure  2.17a  has  a  good  fit  with  the  data  — 
that  is,  the  correlations  in  Matrix  A;  and  (c)  the  path  model  is  confirmed  in 
regard  to  the  second  test  of  confirmation  /  disconfirmation.  For  Matrix  B, 
however,  discrepancies  exist  between  the  observed  and  reproduced  correlations. 
These  results  connote  that  (a)  is  not  equal  to  zero  and  the  terms 
crossed  out  of  the  decomposition  equations  for  r^*,  £42*'  £43*  should 
have  been  retained;  (b)  the  path  model  in  Figure  2.17a  does  not  have  a  good  fit 
with  the  data  represented  by  Matrix  J3;  and  (c)  the  path  model  is 
disconfirmed. 

Insert  Figure  2.17  about  Here 

To  sunmarize,  the  a  priori  prediction  that  =  0  in  Figure  2.17a 
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through  2.37  that  has  a  term  crossed  out  (i.e.,  Bqs.  2.35,  2.36,  and  2.37) 
is  (b)  not  constrained  to  be  exactly  reproduced  by  the  path  coefficients. 

In  more  specific  terms,  r^*,  £42*'  and  £43*  are  sub3ect  to  30 
over  identifying  restriction  (i.e.,  p^  -  0),  and  the  reproduced 
(sometimes  referred  to  as  impl ied)  correlations  resulting  from  Bqs.  2.35 
through  2.37  may  differ  from  their  observed  couiterparts  (r^,  r^* 
r^) .  Asterisks  are  used  to  designate  correlations  subject  to 
over  identifying  restrictions. 

The  hypothesis  that  =  0  leads  to  the  predictions  that  r.3*, 
r^2*»  and  £43*  will  be  equal  to  their  observed  counterparts,  namely 
^41'  £42'  £43*  Consequently,  if  the  reproduced  correlations  are 
equal  to  their  observed  counterparts,  then  the  predictions  are  confirmed. 
This  implies  that  £^  is  equal  to  zero.  Suppose,  however,  that  the  a 
priori  hypothesis  that  p^  =  0  is  wrong,  and  jo43  >0.  In  this 
situation,  r^ *  ¥  r^,  r^*  ¥  £43,  and  ^  ¥  £43.  For 
example,  if  all  correlations  and  path  coefficients  are  positive,  then  £43* 
will  underestimate  r^  by  a  value  equal  to  .E43E32E21  +  -E43H31' 
the  terms  deleted  from  Bq.  2.35.  Thus,  the  goodness  of  fit  test  leads  to 
disconf innation  of  the  predictions  evolving  from  a  model  with  £^3  =  0.  The 
implication  of  disconf irmation  is  that  z 3  has  a  direct  effect  on  z^. 

We  will  now  demonstrate  the  above  rationale  with  an  empirical 
illustration,  using  the  prediction  that  =  0  in  place  of  £^3  =  0. 

Figure  2.17  presents  a  path  model  (Figure  2.17a)  and  path  equations 
(Figure  2.17b)  that  are  similar  to  the  model  and  equations  in  Figure  2.16, 
except  for  the  fact  that  £^3  is  predicted  to  be  equal  to  zero  in  Figure 
2.17a.  Consequently  the  £^£3  tem  *s  deleted  from  the  z^  path 
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resulted  in  an  overidentification  condition  in  vdiich  three  correlations 


were  subject  to  an  over identifying  restriction.  In  other  words,  three 
reproduced  correlations  were  not  constrained  to  be  equal  to  their  observed 
counterparts,  although  they  were  predicted  to  be  equal  to  their  observed 
counterparts.  The  correlations  subject  to  the  overidentifying  restriction 
may  be  ascertained  by  deriving  the  decomposition  equations  using  Eq.  2.18 
for  a  just  identified  model  (i.e.,  all  paths  included) ,  and  then  crossing 
out  those  terms  in  the  decomposition  equations  that  involve  a  path 
coefficient  assumed  to  be  equal  to  zero.  Reproduced  correlations  may  then 
be  calculated  and  compared  to  observed  correlations.  Divergence  of 
reproduced  from  observed  correlations  disconfirms  predictions  regarding  the 
reproduced  correlations  and  implies  that  a  priori  hypotheses  are 
incorrect.  Reproduced  correlations  equal  to  observed  correlations  denote 
confirmation  of  predictions  and  imply  that  the  a  priori  hypotheses  may  be 
valid. 


Note  that  only  correlations  subject  to  overidentifying  restrictions 
should  enter  into  goodness  of  fit  tests.  It  would  make  no  sense  whatever 
to  use  £21,  £31,  o£  £32  in  Figure  2.17  as  a  basis  for  testing 

goodness  of  fit  inasmuch  as  these  correlations  will  be  reproduced  exactly. 


Moreover,  the  assumption  that  =  0  must  be  made  a  priori .  lb 

reiterate  a  point,  consider  a  scenario  in  which  a  just  identified  model  is 


used  to  calculate  the  values  of  all  path  coefficients,  and  jo^  is 
found  equal  to  zero.  is  then  deleted  from  the  model,  and  the  model 

is  regarded  as  overidentified.  A  goodness  of  fit  test  of  predictions 


evolving  from  jo^  =  0  could  not  miss  because  one  would  know  that  the 
calculated  £„  =  0.  Thus,  the  reproduced  correlations  would  equal 
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the  observed  correlations.  This  is  obviously  not  a  legitimate  goodness  of 
fit  test,  and  the  time  has  arrived  for  authors  and  reviewers  to  recognize 
that  the  researcher  must  test  the  goodness  of  fit  of  the  model  he  or  she 
begins  with  and  not  the  model  obtained  by  theory  trimming  (i.e.,  after 
seeing  the  data) . 

The  preceding  discussion  was  based  on  population  values.  With  sample 
values,  goodness  of  fit  tests  are  designed  to  ascertain  whether  reproduced 
correlations  subject  to  overidentifying  restrictions  are  different  than 
their  observed  counterparts.  [A  method  of  estimation  other  than  OLS 
(e.g.,  maximum  likelihood)  must  be  used  to  estimate  structural  parameters 
if  a  significance  test  is  desired  in  the  comparison  of  the  reproduced  and 
observed  correlations.]  However,  the  comparison  between  reproduced  and 
observed  correlations  is  often  not  recommended  for  actual  practice  with 
manifest  variable  designs  because  (a)  it  is  complex  statistically,  (b)  it 
is  difficult  to  interpret  when  more  than  one  path  is  omitted  from  a  model, 
and  most  importantly  (c)  the  omitted  parameter  test  is  a  more  direct  method 
that  is  easier  both  to  compute  and  to  interpret  (cf.  IXincan,  1975; 

Namboodiri  et  al.,  1975).  We  illustrate  the  difficulty  of  interpretation 
issue  below,  and  then  demonstrate  how  it  is  possible  to  address  the  same 
issue  with  the  omitted  parameter  test  in  a  more  direct  manner. 

Consider  the  model  in  Figure  2.18,  which  predicts  that  both  £32 
and  are  equal  to  zero.  The  path  equations  are  as  follows: 

-2  =  -221-1  +  -2  (2.38) 


-3  =  £31-1  +  -3 


(2.39) 
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*4  =  £4#2  +  *4^3  +  ^4 


(2.40) 


Insert  Figure  2.18  about  here 


If  the  model  were  just  identified,  the  decomposition  equations  would  be 
the  same  as  Bqs.  2.25  through  2.30.  These  equations  are  show  below,  where 
all  terms  involving  p32  and  p^  have  been  crossed  out.  With  these 
terms  omitted,  there  are  six  known  correlations  and  four  unknown  path 
coefficients.  Thus,  the  model  is  overidentified.  Once  the  four  path 
coefficients  are  solved  for  in  the  population  (estimated  in  a  sample) ,  all 
correlations  except  p12  are  shown  to  be  subject  to  an  over identifying 
restriction. 

-21  =  £21 

-31*  =  ^32^21  +  -Ell 

-32*  =^f+  ^31^21 

•%1*  =  %3%2^21  +  ^43^31  +  ^42^21  +^%f 
U2  =  J4^32  +  -243£31%1  +  £42  1 

£43*  35  P43  +<^4^32  +  -£42£3l£21  +  ^3^3^21 

Suppose  all  £^*  are  equivalent  to  their  observed  cow  ter  parts. 

This  denotes  that  predictions  regarding  reproduced  correlations  are 
confirmed,  and  implies  that  p32  =  p41  =  0.  Now,  suppose  p31*  = 


(2.41) 

(2.42) 

(2.43) 

(2.44) 

(2.45) 

(2.46) 


* 


but  r^*,  £42*'  an^  £43*  are  not  ec?ua-*-  to 


—31  and  -32*  -  —32' 


their  observed  counterparts.  This  pattern  suggests  that  (a)  £^2  is  equal 
to  zero,  and  (b)  £^  is  different  from  zero.  That  is,  the  .r.  >  involving 
only  £^2  led  to  accurate  reproduced  correlations,  while  the  jr. j* 
involving  £41  (and  £^2)  did  not.  Predictions  regarding  correlations 
containing  are  disconfirmed  because  £^  is  presumably  not  equal  to  0. 


Finally,  suppose  that  all  of  the  £. >  diverge  from  the  observed 

— ij*  — 31* /  —31  and  —32*  ^  —32  that  £32  /  0 •  However, 

the  £4^*  /  £4 j  (j.  =  1,2,3)  may  be  a  result  of  either  (a)  £^2  f  0 

(which  appears  to  be  the  case)  or  (b)  £^2  and  £^  /  0.  That  is,  we  can 
not  infer  whether  £^  is  equal  or  not  equal  to  zero  with  this  test.  This 
is  hardly  a  trivial  concern  if  one  wishes  to  ascertain  sources  of 
discontinuation.  As  a  result,  the  reproduced  correlation  test  applied  to  a 
single  model  is  of  questionable  use  in  actual  practice  with  moderately  to 
highly  overidentified  manifest  variable  designs.  Duncan  (1975)  and  Bentler 
(1980)  review  other  problsns  of  a  statistical  nature.  A  refinement  of  this 
test  has  been  suggested  for  latent  variable  analysis,  and  will  be  discussed 
in  that  section. 


The  Quitted  Parameter  Test 


The  logic  of  the  reproduced  correlation  test  is  that  if  the  values  of 
£^  assumed  equal  to  zero  are  in  fact  zero,  then  reproduced  correlations 
subject  to  over identifying  restrictions  should  be  equal  to  their  observed 
counterparts.  It  would  seem  that  one  could  save  considerable  fuss  by 
simply  estimating,  in  a  sample,  the  values  of  the  £^  assumed  equal  to 
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zero  to  see  if,  in  fact,  they  do  not  differ  significantly  from  zero.  That 
is,  if  a  path  coefficient  is  hypothesized  to  equal  zero,  then  one  would 
predict  that  its  estimated  value  should  not  differ  significantly  from 
zero.  If  the  estimated  path  coefficient  is  not  significantly  different 
from  zero,  then  the  prediction  is  confirmed.  If  it  is  significantly 
different  from  zero,  then  the  prediction  is  discontinued.  This  is  the 
omitted  parameter  test,  and  is  logically  equivalent  to  the  reproduced 
correlation  test.  More  importantly,  it  provides  a  method  to  locate  sources 
of  discontinuation  in  recursive  models. 

To  begin  the  illustration,  let  us  return  to  Figure  2.17,  where  it  was 
shown  that  the  (population)  path  model  did  not  have  a  good  fit  with  Matrix 
B.  This  implied  that  the  omitted  parameter  £41  was  not  equal  to  zero. 

It  is  possible  to  solve  for  the  value  of  jo^  by  including  £41  in 
the  path  equation  for  (Eq.  2.17c)  and  conducting  a  new  OLS  analysis 
on  the  equation:  =  jg^z^  +  £42-2  +  £43^-3  +  —3’  'I*ie 

population  value  of  given  by  an  OLS  analysis  using  the  correlations 
in  Matrix  B  is  .20,  vAiich  differs  considerably  from  zero.  Thus,  the 
omitted  parameter  test  and  the  reproduced  correlation  test  result  in  the 
same  conclusion  for  this  simple  model.  Note  also  that  if  (a)  the  terms 
deleted  from  the  decomposition  equations  in  Figure  2.17c  are  instead 
retained,  and  if  (b)  the  value  of  .20  is  used  for  then  (c)  the 
observed  and  reproduced  correlations  for  r^,  r^r  and  r^  are 
identical . 

Things  are  often  not  this  simple.  Let  us  continue  with  a  discussion 
of  the  omitted  parameter  test,  where  sample  data  are  now  assuned  because  we 
wish  to  address  significance  tests.  Consider  the  path  coefficients  in  the 
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original  path  equations  for  Figure  2.18  (Eqs.  2.38  through  2.40).  We  will 
presume  that  the  values  of  these  coefficients  are  estimated  using  OLS.  The 
goodness  of  fit  test  is  then  conducted  by  inserting  the  variables  with  path 
coefficients  predicted  to  equal  zero  into  the  appropriate  equations  and 
conducting  new  OLS  analyses.  In  the  present  case,  OLS  would  be  conducted 
to  estimate  the  path  coefficients  in  the  following  equations  (the  z 
equation  does  not  have  an  omitted  parameter) : 


—3  -  P31-J.  +  £32%  +  — 2  (2*47) 

I4  =  %  *^1  +  £42^2  +  %^3  +  £4  (2‘48) 

where  asterisks  indicate  the  path  coefficients  that  should  be  equal  to  zero, 

and  circunflexes  denote  that  we  are  dealing  with  sample  estimates  of  path 

coefficients  (and  disturbances) . 

A  A 

If  P32*  and  P41*  are  not  significantly  different  from  zero, 
then  the  predictions  are  confirmed.  Confirmation  implies  that  the 
parameters  or  path  coefficients  p^  and  p^  are  zero  and  that  this 
part  of  the  original  path  model  is  valid.  That  is,  the  omitted  paths 

A  A 

should  have  been  omitted.  If  £32*  and/or  p^*  is  significant,  then 
at  least  one  prediction  is  discontinued.  Discontinuation  implies  that  the 
original  path  model  is  invalid,  which  is  to  say  that  at  least  one  omitted 
path  should  not  have  been  omitted  (i.e.,  p^2  and/or  p^  is  not 

equal  to  zero) .  It  is  particularly  important  to  note  that  this  process 
allows  one  to  ascertain  precisely  the  source  of  disconfirmation. 

In  samples,  £32*  and  £43*  are  tested  for  significance  using 
standard  tests  for  regression  weights.  There  are,  however,  some  problems 
with  the  use  of  these  significance  tests.  First,  confirmatory  analysis  is 
usually  conducted  on  large  samples.  Thus,  the  test  of  significance  is 
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rather  powerful,  and  paths  with  rather  trivial  magnitudes  (e.g.,  a 
=  .05)  may  be  statistically  significant  and  lead  to  disconfirmation.  There 
are  no  clear  rules  of  thumb  here;  one  may  wish  to  go  beyond  the  data  and 
argue  that  a  significant  but  very  low  path  coefficient,  less  than  |+ . 05j  for 
exanple,  does  not  do  serious  damage  to  a  model.  This  is  a  matter  of 
judgment,  however,  and  the  author  must  have  a  strong  theoretical  defense  in 
support  of  such  a  decision. 

A  second  problem  is  multicoil inear ity,  which  occurs  when  two  or  more 
variables  in  the  same  equation  are  correlated  highly  with  one  another  (see 
Johnston,  1972,  and  Gordon,  1963,  for  discussions  of  indicators  of 
mul ticol linear ity) .  A  key  product  of  multicoil inear ity  is  that  "it  becomes 
very  difficult,  if  not  impossible,  to  disentangle  the  relative  influences 
of  the  various  X  [causal]  variables"  (Johnston,  1972,  p.  160).  In 
effect,  one  can  place  little  faith  in  sample  estimates  of  structural 
parameters  and  significance  tests  (cf.  Gordon,  1968).  The  probability  of 
a  multicoil inear ity  condition  is  of  serious  concern  in  omitted  parameter 
tests  (James  &  Jones,  1980).  Consider,  for  example,  Eq.  2.48,  where 

A 

is  to  be  tested  for  significance.  Variable  z^  is  included  in 
an  equation  with  two  variables  for  which  it  is  a  cause,  namely  z 2  and 
z  ,  If  z^  is  a  strong  cause  of  z2  and  Zy  then  z^  will  be 
correlated  highly  with  z 2  and  Zy  It  is  possible,  therefore,  that  the 
equation  used  for  the  omitted  parameter  test  (Eq.  2.48)  ,  which  considers, 

z_y  and  as  causes,  could  be  subject  to  a  multicoil  inear  ity 
condition.  Given  multicollinearity,  one  has  several  options,  including  (a) 
hierarchical  regression  (cf.  Cohen  &  Cohen,  1975)  or  (b)  a  disturbance  term 
regression  test  described  in  James  and  Jones  (1980).  Inasmuch  as  these  tests 
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concern  primarily  statistical  issues,  they  are  not  discussed  here. 

A  final  issue  is  protection  of  the  significance  level  when  multiple, 
nonindependent  significance  tests  are  conducted  on  the  same  sample.  This 
issue  has  not  been  a  concern  in  manifest  variable  analyses,  although  it 
should  be  in  both  Conditions  9  and  10.  We  will  treat  this  issue  briefly  in 
the  section  on  latent  variables. 

A  Combination  of  the  First  and  Second  Ttests  of  Confirmation/Di sconfirmation 

In  the  preceding  condition  (Condition  9)  we  described  how  it  is 
possible  to  confirm/disconfirm  predictions  furnished  by  structural  (path) 
models  and  functional  (path)  equations.  This  first  test  of  confirmation/ 
disconf innation  addresses  predictions  that  estimated  values  of  structural 
parameters  (path  coefficients)  included  in  functional  equations  should  be 
significantly  different  from  zero.  The  second  test  of  confirmation/ 
disconf innation  described  in  the  present  section  addresses  predictions  that 
estimated  values  of  structural  parameters  (path  coefficients)  not  included 
in  functional  equations  (i.e.,  omitted  parameters)  should  not  be 
significantly  different  than  zero.  The  strongest  case  for 
confirmatiorx/disconfirmation  is  made  if  both  tests  are  conducted,  which 
requires  at  least  some  over identified  equations  to  conduct  the  second  test. 

lb  review  how  the  tests  are  combined,  consider  the  path  equation  for 
in  Figure  2.18,  vrfiich  may  be  viewed  as: 

U  *  <P_41*  =  °>4  +  £42*2  +  £4^3  +  £4  (2-49) 


=  £4^2  +  £43^3  +  £4 


(2.50) 
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The  first  test  of  confirmation  /  disconfirmation  tests  the  predictions 

A  A, 

that  the  sample  OLS  estimates  of  arv^  P43*  or  P42  anc*  P43' 
are  significant.  If  both  estimates  are  significant,  we  have  partial 
confirmation  for  this  structural  equation.  The  term  "partial  confirmation" 
is  used  because  the  values  of  p^  and  p^  are  estimated  conditional 

/N  A 

on  the  assumption  that  p^  =0.  If  p42  and/or  p43  is 
nonsignificant  (the  interpretation  of  significance  is  left  to  the 
investigator) ,  then  this  functional  equation,  and  therefore  the  structural 
model,  are  disconfirmed,  although  we  recommend  that  one  still  proceed  to  the 
second  test  of  confirmation/disconfirmation.  The  second  test  concerns  the 
prediction,  based  on  the  assumption  that  p^*  =  0,  that  is  not 

significant.  The  test  requires  another  OLS  analysis  to  obtain  a  value  of 

A  ^  A 

p^.  If  p^*  is  not  significant,  then  the  prediction  has  been 
confirmed  and  the  functional  equation  is  said  to  be  (logically)  consistent 

A  -a. 

with,  or  to  have  a  good  fit  with,  the  data,  given  that  p42  and  p43 
are  significant.  If,  however,  (a)  either  p42  or  p43»  or  both,  is 
nonsignificant,  and/or  if  (b)  is  significant,  then  (c)  the 

functional  equation  and  the  model  are  disconfirmed.  Either  source  of 
disconfirmation  (i.e.,  the  first  or  second  test)  implies  that  the  equation 
and  the  model  are  logically  inconsistent,  or  fail  to  have  a  good  fit,  with 
the  data. 

To  summarize,  if  all  functional  equations  in  a  structural  model  pass  the 
first  test  of  confirmation/disconfirmation,  and  if  all  over  identified 
functional  equations  pass  the  second  test  of  confirmation/disconfirmation. 


then  the  structural  model  is  said  to  have  a  good  fit  with  the  data,  or  to  be 
logically  consistent  with  the  data.  We  can  say,  therefore,  that  the 
structural  model  is  confirmed  because  it  has  passed  both  tests  required  for 
confirmation.  However  we  can  not  conclude  that  the  structural  model  v*iich 
generated  the  predictions  is  a  "true  model".  By  true  we  mean  that  the  model 
and  functional  equations  accurately  represent  causal  processes.  On  the  other 
hand,  if  a  structural  model  is  disconfirmed  by  either  test,  we  have  a  strong, 
but  not  necessarily  unequivocal,  case  for  concluding  that  the  model  is  false. 
These  points  are  discussed  in  the  next  section,  where  we  address  "causal 
inference" . 

3.  CAUSAL  INFERENCE  WITH  MANIFEST  VARIABLES 

"The  insidious  thing  about  the  causal  point  of  view 
is  that  it  leads  us  to  say:  'Of  course,  it  had  to 
happen  like  that.'  Whereas  we  ought  to  think:  It 
may  have  happened  like  that — and  also  in  many 
other  ways." 

—  Ludwig  Wittgenstein  (1980) 

We  begin  this  section  by  restating  the  third  principle  for  the  logical 
foundation  of  confirmatory  analysis  (the  principles  were  presented  in 
Condition  9,  Section  2).  The  principle  is: 

3.  Confirmation  of  predictions  implies  corroborative  support 
for  the  structural  model  represented  by  the  functional  relations 
and  equations.  Disconfirmation  of  predictions  implies  that  one 
or  more  components  of  the  structural  model  (functional  relations 


and  equations)  is  false,  in  which  case  it  is  concluded  that  the 
structural  model  as  proposed  originally  is  invalid. 


We  focus  first  on  confirmation.  The  corroborative  support  provided  by 
confirmation  implies  completion  of  the  following  progression:  (a)  causal 
hypotheses  have  been  made  and  formally  specified  in  a  structural  model  and  in 
functional  equations;  (b)  conditions  pertaining  to  the  appropriateness  of 
theoretical  models  (Conditions  1  through  7)  and  operationalization  of 
variables  (Condition  8)  have  been  reasonably  satisfied;  (c)  a  set  of 
predictions  regarding  significance  of  structural  parameters  and/or 
reproduction  of  a  correlation  variance-covariance)  matrice  have  been  derived 
from  the  structural  model  and  functional  equations;  and  (d)  the  predictions 
have  been  confirmed  by  empirical  tests.  This  progression  suggests  that  one  is 
in  the  position  to  make  the  judgment,  or  causal  inference,  that  the  causal 
hypotheses  on  viiich  the  structural  model  and  functional  equations  are  based 
are  useful  for  explaining  how  and  why  the  endogenous  variables  in  the 
model  occurred  and  are  related  to  other  endogenous  as  well  as  exogenous 
variables.  Explanation  is  possible  because  structural  models  and  functional 
equations  provide  explicit,  quantitative  statements  of  theory  that, 
presumably,  specify  the  rules  that  govern  the  occurrences  of  natural  events 
and  the  structure  of  observed  relationships  among  naturally  occurring  events 
(Heise,  1969) .  Confirmation  of  a  structural  model  suggests  that  these  rules 
are  useful  and  may  be  used  to  infer  the  causal  processes  that  are,  and  were, 
operating. 

One  must  be  mindful  that  it  is  necessary  to  make  a  number  of 
untestable  assumptions  in  the  process  of  conducting  a  confirmatory 
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analysis.  By  untestable  assumptions  we  mean  that  reasonable  satisfaction 
of  subsets  of  conditions  from  Conditions  1  through  8  is  based  on  "faith" 
and  not  empirical  tests.  For  example,  with  cross-sectional  data  one  cannot 
test  the  stability  condition  (Condition  7) ,  operationalize  a  cyclical 
recursive  model  (Condition  4),  or  attempt  to  control  bias  resulting  from  an 
unmeasured  variables  problem  (Condition  5) .  Consequently,  reasons  must  be 
offered  for  assumptions  that  the  model  is  stable  and  in  an  equilibrium- type 
condition,  that  causal  relations  are  noncyclical  within  a  temporal  bound, 
and  that  the  unavoidable  unmeasured  variables  problem  is  not  sufficiently 
serious  to  preclude  confirmatory  analysis.  Prior  research  may  furnish  a 
basis  for  assumptions,  but  an  investigator  cannot  avoid  the  fact  that 
he/ she  is  relying  on  faith  when  he/she  assumes  that  untested  conditions  are 
reasonably  satisfied. 

The  issue  of  reasonable  satisfaction  is  also  important  in  regard  to 
conditions  that  can  be  tested  empirically.  It  is  usually  necessary  to  rely 
on  reasonable  satisfaction,  rather  than  perfect  satisfaction,  of  conditions 
or  subsets  of  conditions  that  can  be  subjected  to  empirical  tests.  For 
example,  perfect  reliability  is  seldcm  attained  in  manifest  variable 
studies.  As  discussed  in  Condition  8,  high  reliability  is  generally 
considered  sufficient  for  reasonable  satisfaction  of  the  reliability 
assumption,  although  no  unequivocal  definition  exists  f«r  "high",  and  the 
effects  of  measurement  error  in  elaborate  models  is  often  unpredictable. 
Other  assumptions  that  are  potentially  testable,  but  for  which  reasonable 
satisfaction  is  considered  sufficient,  include  the  construct  validity  of 
manifest  variables,  interval  measurement  scales,  and  linearity.  Although 
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reasonable  satisfaction  of  empirically  testable  assumptions  is  considered 
sufficient  for  proceeding  with  confirmatory  analysis,  the  cumulative 
effects  of  various,  presumably  nonserious,  violations  are  essentially 
unknown . 

Untested  assumptions,  in  combination  with  reasonable  rather  than 
perfect  satisfaction  of  tested  assumptions,  suggest  that  confirmation  of  a 
structural  model  is  not  synonymous  with  proof  that  the  model  is  true  or 
correct.  Confirmation  implies  only  that  those  assumptions  that  were  tested 
by  empirical  analysis  provided  corroborative  support  for  the  structural 
model.  This  support  may  disappear,  however,  if  an  untested  assumption  is 
shown  later  to  be  false,  or  if  what  appears  to  be  reasonable  satisfaction 
of  an  empirically  tested  assumption  is  in  fact  unreasonable.  These  points 
lead  us  to  the  crucial  conclusion  that  ^t  .is  possible  to  confirm  a  false 
structural  model.  On  the  other  hand ,  rt  is  also  possible  to  disconfirm  a 
true  model.  For  example,  a  nonsignificant  parameter  estimate  between 
variables  thought  to  be  causally  connected  may  be  due  to  imprecision  in  the 
measurement  of  the  variables  rather  than  lack  of  true  causal  connection 
(cf.  Cook  &  Campbell,  1979).  Thus,  disconfirmation  is  not  necessarily 
synonymous  with  disproof  of  a  structural  model,  although,  as  discussed 
shortly,  disconfirmation^  in  comparison  to  confirmation,  has  stronger 
implications  for  the  usefulness  of  a  structural  model. 

Before  contrasting  confirmation  and  disconfirmation,  allow  us  to 
furnish  a  simple  illustration  of  how  it  is  possible  to  confirm  a  false 
structural  model.  Consider  Figure  3.1,  which  presents  a  false  path  model 
and  a  true  path  model  for  a  manifest  variable  design.  The  difference 
between  the  models  is  that  the  true  model  includes  an  additional,  exogenous 
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cause  of  that  is  correlated  with  z^,  but  not  with  z^.  This 
suggests  that  z  is  an  unmeasured  relevant  cause  of  z,  in  the  false 
model,  which  implies  that  Condition  5  (self-containment)  is  violated  in  the 
z_ ^  equation.  However,  note  the  predictions  for  the  two  models.  Both 
the  false  and  true  models  predict  that  p2^  and  p22  ^ 

significant  (first  test  of  confirmation/disconfirmation  discussed  in 
Section  2,  Condition  9).  The  difference  between  the  models  is  that  p^2 
in  the  false  model  will  be  biased  in  an  upward  direction  due  to  lack  of  a 
control  for  z^  (assuming  relations  are  linear  and  positive,  and 
disregarding  suppressors)  .  Furthermore,  both  models  predict  that  P^* 
will  be  nonsignificant  (second  test  of  confirmation/disconfirmation 
discussed  in  Section  2,  Condition  10) .  The  consequence  of  this  situation 
is  that  the  true  model  furnishes  no  clear  case  for  discontinuing  the  false 
model  if  one  were  to  employ  the  false  model  in  a  confirmatory  analysis.  In 
other  words,  given  that  the  true  model  is  in  fact  true,  the  false  model 
would  be  confirmed. 


Insert  Figure  3.1  about  here 


Numerous  other  illustrations  could  be  used  to  demonstrate  how  it  is 
possible  to  confirm  false  models  (cf.  Duncan,  1975).  The  simple  example 
in  Figure  3.1  makes  the  point  that  confirmation  does  not  imply  truth  in  a 
structural  model.  We  should  also  mention  that  this  point  is  often 
characterized  in  a  different  manner  in  the  confirmatory  literature.  The 
characterization  is  as  follows:  (a)  a  structural  model  will  generate  one 
and  only  one  set  of  correlations  ( var iances-covariances)  among  manifest 
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variables,  although  (b)  a  particular  correlation  matrix  may  be  generated  by 
more  than  one  structural  model.  Thus,  two  or  more  structural  models  may 
generate  the  same  correlation  matrix,  and  demonstration  that  a  model 
reproduces  a  correlation  matrix  does  not  imply  that  it  is  a  true,  or  a 
unique,  structural  model.  This  characterization  may  be  tied  to  our  example 
in  Figure  3.1  by  noting  that  £32*  =  0  implies  that  r^  =  £21^32 

in  both  the  true  and  false  models  (given  that  and  are 
unrelated  in  the  true  model) . 

This  problem  has  been  a  subject  of  concern  in  the  philosophical 
literature,  as  well  as  to  us  in  advocating  confirmatory  analysis.  We  will 
now  draw  from  the  philosophical  and  other  literatures  to  explore  this  issue 
further,  our  objectives  being  to  point  out  a  logical  asymmetry  between 
confirmation  and  disconfirmation,  to  discuss  the  role  of  causal  inference 
in  science,  and  to  suggest  approaches  to  confirmatory  analysis  that  furnish 
a  comparatively  stronger  base  for  causal  inference. 

Returning  to  the  question  of  confirmation,  we  have  endeavored  to  point 
out  that  confirmation  of  predictions  regarding  the  signficance  (first  test) 
and  nonsignificance  (second  test)  of  estimated  structural  parameters,  where 
predictions  were  deduced  for  causal  hypotheses,  can  not  be  used  to 
establish  the  truth  of  the  causal  hypotheses.  To  maintain  fidelity  with 
the  philosophical  literature,  we  shall  refer  to  predictions  pertaining  to 
the  significance/nonsignificance  of  estimated  structural  parameters  as 
"correlational  hypotheses".  Thus,  given  confirmation  of  a  correlational 
hypothesis,  to  conclude  that  the  causal  hypothesis  (P)  is  true  because  the 
correlational  hypothesis  (£)  deduced  from  it  is  true  (confirmed)  would 
involve  committing  the  fallacy  of  affirming  the  consequent.  This  fallacy 


occurs  in  arguments  of  the  form,  "P  implies  ^  is  true;  therefore,  P  is 
true".  The  fallacy  lies  in  the  possibility  that  P  logically  may  be  false 
even  though  Q  is  true.  The  only  case  in  which  the  truth  of  P  could  be 
inferred  from  the  truth  of  Q  would  be  if  the  truth  of  P  were  both 
necessary  and  sufficient  for  the  truth  of  Q.  With  empirical  phenomena 
this  is  not  feasible  because  one  can  never  rule  out  the  logical  possibility 
that  other  reasons  exist  for  £  to  be  true  than  those  considered  when 
deducing  that  Q  is  implied  by  P.  Try  as  we  do  to  establish  closed  systems, 
we  never  can  be  certain  that  we  have  done  so.  Cn  the  other  hand,  it  is  not 
a  fallacy  to  conclude  that  a  causal  hypothesis  (P)  is  false  if  the 
correlational  hypothesis  (Q)  deduced  from  it  is  false  (i.e.,  disconfirmed) . 
This  is  the  form  of  the  argunent  known  by  logicians  as  modus  tollens, 
which  has  the  form:  "P  implies  Q:  Q  is  false;  therefore,  P  is  false." 
Note,  however,  that  this  logic  does  not  consider  the  possibility  that  a 
correlational  hypothesis  may  be  falsified  because  of  specification  errors, 
especially  imprecision  in  the  measurement  of  variables.  Thus,  even 
falsification  of  correlational  hypotheses  is  not  unequivocal,  which 
suggests  that  one  should  be  circumspect  about  using  a  single 
disconfirmation  to  falsify  a  causal  hypothesis  (cf.  Cook  &  Campbell,  1979; 
Popper,  1959) 

For  reasons  such  as  those  discussed  above,  the  philosopher  Popper 
(1959)  has  argued  that  scientists  can  never  deduce  the  validity  of  a 
general  proposition  about  the  world  from  the  truth  of  a  proposition 
concerning  a  particular  event  deduced  from  the  general  proposition.  On  the 
other  hand,  Popper  argued  that  one  can  correctly  infer  the  falsity  of  a 
general  proposition  from  the  falsity  of  a  proposition  concerning  a  specific 
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event  deduced  from  the  general  proposition.  Thus,  as  far  as  deductive  logic 
is  concerned,  Popper  argues  that  science  can  only  infer  the  falsity  of 
general  theoretical  propositions  from  particular  events,  never  their 
validity.  Nevertheless,  Popper  recognizes  the  inclination  of  scientists  to 
act  as  if  their  general  theories  are  true  when  empirical  consequences 
deduced  from  them  are  confirmed  in  experience.  He  euphemistically  refers 
to  these  situations  as  those  in  which  theories  are  "corroborated"  by  the 
confirmation  of  their  particular  consequences. 

We  have  already  discussed  the  point  that  the  inference  that  a  causal 
hypothesis  is  true  when  a  consequence  (prediction)  deduced  from  that 
hypothesis  is  confirmed  in  experience  would  involve  the  commission  of  a 
logical  fallacy.  The  logical  possibility  always  exists  that  the  prediction 
deduced  from  the  causal  hypothesis  is  confirmed  for  reasons  other  than 
those  we  considered  Wien  formulating  the  causal  hypothesis.  This 
limitation  on  the  possibility  for  logical  inference  from  empirically 
confirmed  consequences  is,  as  discussed  at  the  early  part  of  this  section, 
a  function  of  the  fact  that  it  is  not  possible  to  test  empirically  all 
conditions  required  for  confirmatory  analysis,  or  to  be  assured  that 
empirically  tested  assumptions  have  been  fully  satisfied.  It  follows 
directly  that  causal  inference  based  on  confirmatory  analysis  must  be 
tenuous,  which  is  to  say  subject  to  alternative  explanations.  In  other 
words,  the  limitation  ccmes  down  to  a  limitation  associated  with  all 
attempts  to  establish  inductive  forms  of  reasoning  where  one  seeks  to 
generalize  from  particulars  in  experience.  Ch  the  other  hand,  the 
inference  that  a  causal  hypothesis  is  false  when  a  prediction  deduced  from 
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that  hypothesis  is  disconfirmed  in  experience  is  not  an  invalid  inference 
as  long  as  one  is  committed  to  act  as  if  the  deduced  consequence  is  false. 
It  is  a  question  of  the  correct  use  of  deductive  logic,  not  whether 
deductive  logic  leads  to  necessary  truths  about  the  world.  Thus,  one 
appears  to  be  in  a  better  position  to  infer  the  falsity  of  causal 
hypotheses  rather  than  to  infer  their  validity  on  the  basis  of  experience. 
This  is  essentially  the  position  advocated  by  Popper  (1959)  and  echoed  by 
others  (cf.  Cook  &  Campbell,  1979;  Duncan,  1975;  Heise,  1975;  Namboodiri 
et  al.,  1975) . 

There  is  something  disturbing,  however,  about  a  position  that  suggests 
that  all  we  can  do  in  science  is  negate  something,  never  affirm  it.  This 
position  seems  to  run  counter  to  our  experiences  in  which  scientific 
theories  are  offered  as  explanations  of  phenomena.  It  runs  counter  to  the 
passion  with  which  we  and  others  affirm  the  truth  of  a  scientific  idea  or 
the  confidence  we  have  when  we  use  laws  of  physics -to  build  bridges  and  to 
operate  space  shuttles.  TP  be  a  scientist  involves  committing  oneself  to 
the  affirmation  of  corroborated  theories  while  at  the  same  time  submitting 
them  continuously  to  empirical  tests. 

In  our  view,  one  must  understand  Popper's  (1959)  emphasis  on  the 
falsification  of  theory  as  the  outcome  of  an  analysis  of  what  could 
rationally  be  concluded  in  attempts  to  validate  scientific  theories. 

Popper  explicitly  ruled  out  as  irrelevant  for  his  analysis  psychological 
( extralog ical)  concerns  such  as  how  theories  might  be  formulated  or 
developed  from  experience.  Presunably,  he  would  also  rule  out  as 


irrelevant  psychological  concerns  about  what  people  would  do  once  they  have 
subjected  a  theory  to  the  test.  Thus,  if  people  act  as  if  a  theory  is 
true  when  it  is  merely  corroborated,  this  must  be  for  Popper  merely  a  quirk 
of  human  behavior  having  no  rational  justification  on  the  basis  of  the 
evidence  used  to  validate  the  theory.  But  being  a  matter  of  extralog ical 
behavior,  it  is  beyond  the  confines  of  his  analysis.  There  can  be  no 
question  that  acting  as  if  a  theory  is  true  vdien  it  has  been  merely 
corroborated — which  we  may  regard  as  a  form  of  inductive  behavior — is  an 
important  aspect  of  the  scientific  enterprise.  Thus,  if  Popper's  account 
of  the  scientific  enterprise  seems  distorted  to  us,  it  is  because  it  is 
actually  limited  to  only  the  logical  aspect  of  the  enterprise. 

It  may  be  disheartening  to  find  that  corroboration  of  a  structural 
model  guarantees  neither  that  the  model  is  unassailably  correct  nor  yields 
a  unique  explanation  of  relations  among  variables  in  the  model.  But,  as  we 
have  endeavored  to  explain,  this  is  the  limitation  of  inductive 
"inference".  What  one  must  keep  in  mind  is  that  the  goal  of  confirmatory 
analysis  in  science  is  to  attempt  to  explain  how  and  why  variables  occur 
and  are  related.  An  intuitive  approach  to  causality  and  causal  explanation 
in  psychological  research  involves  proposing  functional  relations  among 
variables  in  the  form  of  structural  models  (theories)  .  If  the  causal 
explanations  represeted  by  functional  relations  (equations)  are  empirically 
corroborated  by  data,  this  does  not  mean  that  we  will  enter  into  a  state  of 
suspended  animation.  Rather  we  will  seek  to  disseminate  our  inferences 
regarding  causal  explanations  so  that  others  may  test  them.  We  also  seek 
to  apply  the  explanations  to  solve  problems,  a  process  which  furnishes 
additional  sources  of  confirmation  /  disconf irmation.  We  continue  in  these 
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and  other  endeavors  until  some  other  explanation  comes  along  to  replace  the 
present  one,  or  until  we  encounter  one,  and  preferably  more  than  one, 
instance  in  which  it  is  clearly  disconfirmed.  Cne  must  strive  to  obtain 
explanations  for  events,  even  if  to  do  so  involves  ambiguities  and 
pitfalls. 

But  what  happens  if  a  particular  set  of  observations  leads  to  the 
corroboration  of  not  just  one  but  of  two  or  more  distinct  causal  theories? 
Popper  (1959)  considered  this  problem,  and  suggested  that  theories  that  are 
more  easily  falsifiable  are  to  be  preferred  to  those  that  are  less  easily 
falsifiable.  Tt>  illustrate  this  rationale,  consider  that  an  easily 
falsifiable  manifest  variable  model  is  one  in  which  a  reasonably 
parsimonious  structure  is  used  to  explain  a  large  number  of  observed 
correlations  among  manifest  variables.  In  other  words,  the  number  of 
parameters  that  are  predicted  to  equal  zero  is  large  in  comparison  to  the 
number  of  parameters  that  are  free  to  vary  and  to  be  estimated,  which  is  to 
say  that  the  model  is  highly  overidentified.  Thus,  we  have  a  comparatively 
large  number  of  over  identifying  restrictions,  any  one  of  which  could 
disconfirm  the  model.  In  contrast,  if  only  a  few  parameters  are  predicted 
to  equal  zero  and  the  rest  are  left  to  be  estimated,  then  only  a  few 
over identifying  restrictions  remain  with  which  to  test  goodness  of  fit. 
Using  Popper's  suggestion,  the  former  model  should  be  preferred  over  the 
latter  model  because  it  is  more  easily  falsified  (and  parsimonious) .  The 
larger  the  number  of  overidentifying  restrictions  (i.e.,  the  higher  the 
degree  of  over identification) ,  the  more  likely  the  model  is  to  be 
falsified.  (Note  that  we  are  dealing  here  only  with  the  second  test  of 
confirmation/disconfirmation.  This  test  is  considered  by  many  to  be  the 
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more  crucial  test  of  goodness  of  fit  because  nonsignificant,  estimated 
structured  parameters  of  minor  theoretical  relevance  can  be  theory  trimmed 
without  major  theoretical  damage  to  a  model.  Nevertheless,  a 
nonsignificant  estimate  of  a  parameter  of  major  theoretical  significance  is 
seriously  damaging  to  a  theoretical  model  and  results  in  disconfirmation 
based  on  the  first  test  of  confirmation/disconfirmation.) 

A  second  procedure  that  builds  faith  in  a  corroborated  structural  model 
is  to  subject  multiple,  highly  over  identified  structural  models  to 
confirmatory  analysis.  In  general,  psychology  has  no  dearth  of  theories 
regarding  the  causes  of  hunan  events.  In  fact,  the  problem  is  often  too  many 
competing  theories  rather  than  too  few  theories.  Each  theory  represents  an 
alternative  explanation  of  events,  and  the  recommended  procedure  is  (a)  to 
develop  a  highly  overidentified  structural  model  for  each  theory,  where  each 
model  includes  causal  hypotheses  that  conflict  with  those  of  the  other  models; 
(b)  to  subject  each  structural  model  to  a  confirmatory  analysis;  and  (c)  to 
contrast  the  results  of  the  confirmatory  analyses  and  ascertain  which  one  of 
the  structural  models  has  the  best  fit  with  the  data  (i.e.,  provides  the  most 
plausible  explanation) .  It  is  possible,  of  course,  that  two  or  more  models 
will  still  have  essentially  equivalent  fits  with  the  data.  Ibis  is  a 
worthwhile  finding,  however,  because  the  investigator  will  not  now  accept 
blindly  one  model  as  corroborated.  Furthermore,  assuming  that  some  models 
were  disconfirmed,  the  investigator  has  a  basis  for  identifying  a  reduced  set 
of  plausible  models.  The  next  step  is  to  articulate  further  each  plausible 
model  in  the  interest  of  deriving  conflicting  hypotheses  that  can  be  tested  in 
future  confirmatory  analyses. 

In  conclusion,  no  claim  can  be  made  that  causal  inferences  lead  to  unique 
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or  unassailably  correct  causal  explanations.  It  must  be  recognized  that 
accumulation  of  scientific  knowledge  may  provide  a  basis  for  refuting  an 
inference  at  some  future  date.  The  process  of  causal  inference  rests  on  the 
premise  that  we  will  use  the  knowledge  that  is  presently  available  to  attempt 
to  explain  why  psychological  phenomena  occur.  This  is  the  historical  premise 
of  science;  it  is  what  one  does  when  he/ she  behaves  as  a  scientist. 


4.  AN  OVERVIEW  OF  LATENT-VARIABLE  STRUCTURAL  MODELS. 
Introduction 

Up  to  this  point  we  have  discussed  linear  causal  modeling  with 
observed  or  manifest  variables.  We  will  now  consider  the  more  general 
case  in  which  some  of  the  variables  of  a  linear  causal  model  are  latent 
or  unobserved  variables.  As  defined  earlier,  a  latent  variable  is  a 
hypothetical  or  theoretical  construct,  which  is  to  say,  an  unobserved 
variable  presumed  to  exist  within  a  structural  model  but  for  which  direct 
measurements  are  not  available. 

Linear  models  with  latent  and  manifest  variables  are  not  new  in 
psychology.  Psychologists  have  worked  with  such  models  for  over  60  years 
in  connection  with  common  factor  analysis  and  the  classical  theory  of 
reliability  with  its  "true"  and  "error"  scores.  However,  structural 
equation  modeling  with  latent  variables  became  practical  only  after 
statisticians  working  in  the  area  of  factor  analysis  (Jflreskog,  1970) 
saw  ways  to  generalize  these  efforts  to  encompass  linear  structural 
equation  modeling  (Wiley  &  Wiley,  1970;  Keesling,  1972;  Wiley,  1973; 
Jflreskog,  1973,  1977,  1979;  Jdreskog  &  Goldberger,  1975).  These 


developments  in  statistical  theory  provided  the  basis  for  the  development 
and  distribution  of  general  purpose  computer  programs  for  structural 
equation  modeling  with  latent  variables  such  as  Jdreskog  and  Sflrbam's 
(1978)  LISREL  IV  program  and  McDonald's  (1978,  1980)  COS AN  program.  As  a 
result,  increasing  numbers  of  psychologists  are  today  discovering  the 
usefulness  of  structural  equation  models  in  their  research,  while 
increasing  numbers  of  sociologists,  econometricians  and  geneticists  are 
using  structural  equation  models  with  latent  variables. 

Reasons  for  latent  variable  models.  There  are  a  number  of  reasons 
for  considering  the  use  of  latent  variables  in  modeling  causal 
relationships. 

First,  since  most  of  our  theoretical  constructs  are  abstractions,  it 
is  convenient  to  distinguish  between  a  hypothetical  construct  as  a  latent 
variable  and  a  concrete  realization  of  it  in  a  particular  manifest 
variable. 

Second,  the  concrete  realization  of  a  construct  is  nonunique;  the 
same  theoretical  construct  may  be  operationalized  in  any  number  of  hays. 
Thus,  we  can  think  of  a  domain  of  manifest  observed  variables,  all  of 
which  have  in  common  the  same  underlying  construct.  For  example,  general 
intelligence  may  be  measured  by  various  tests — for  example,  the 
Stan ford-Bi net,  the  Wechsler  Mult  Intelligence  Scale  (WAIS) ,  Raven's 
Progressive  Matrices,  and  the  Lorg e-Thorndike  Test.  In  principle,  the 
number  of  ways  in  which  a  construct  may  be  implemented  operationally 
through  manifest  variables  is  unlimited. 

Third,  our  observed  variables  may  involve  fallible  measurements, 
which  means  that  in  addition  to  the  effects  of  the  hypothesized 


constructs,  the  values  of  our  variables  may  also  reflect  errors  of 
measurement  of  an  unsystematic  nature.  Attempts  to  estimate  structural 
parameters  with  manifest-variable  approaches  vtfien  the  variables  contain 
nonnegligible  errors  of  measurement  may  lead  to  serious  bias  in  the 
estimates. 

Fourth,  measurement  of  manifest  variables  may  include  systematic  or 
nonrandom  sources  of  error,  such  as  variation  due  to  method,  context,  or 
person  variables.  These  too,  if  overlooked  or  ignored,  will  lead  to 
serious  bias  in  estimates  of  structural  parameters.  Latent  variable 
approaches  provide  means  to  deal  with  both  random  and  nonrandom 
measurement  errors. 

Finally,  we  may  not  be  able  to  devise  direct,  univocal  measures  of 
the  theoretical  construct  in  question  because  the  effects  of  the 
theoretical  construct  may  never  occur  in  isolation  apart  from  the  effects 
of  other  causal  variables.  For  example,  we  may  never  be  able  to  devise  a 
measure  of  general  intelligence  (conceived  of  as  the  capacity  to  perceive 
relationships)  without  its  effects  being  confounded  with  other  variables, 
such  as  the  various  capacities  involved  in  coding  the  content  of  the 
stimulus  materials  (e.g.,  verbal,  figural,  numerical).  In  general,  it  is 
situations  such  as  the  above  that  may  be  treated  with  models  involving 
both  manifest  and  latent  variables. 

Developing  latent  variable  models.  Consider  the  latent  structural 
model  shovn  in  Figure  4.1a.  This  model  is  essentially  the  model  of  the 
causes  and  effects  of  perceived  role  overload  presented  earlier  in  connection 
with  Figure  2.1.  However,  in  this  case  the  variables  in  the  model  are 
represented  by  perfectly  rel iable  latent  variables.  They  are  perfectly 
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reliable  latent  variables  because  they  correspond  to  hypothetical  construct 
variables  that  have  not  yet  been  operationalized  by  linking  them  to  specific 
manifest  measures.  The  four  hypothetical  constructs  or  latent  variables  are 
(1)  management's  expectations  regarding  quality  of  employees'  role 
performance  (expected  quality),  (2)  management's  expectations  regarding 
quantity  of  employees'  role  performance  (expected  quantity)  ,  (3)  employees' 
perceptions  of  role  overload  (role  overload) ,  and  (4)  employees'  state 
anxiety.  The  model  states  that  anong  employees  in  an  organization,  the  more 
their  supervisors  expect  high  quality  of  performance,  and  the  more  their 
supervisors  expect  high  quantity  of  performance,  the  more  the  employees  will 
perceive  role  overload,  which  in  turn  will  increase  the  employees'  levels  of 
state  anxiety.  The  unlabelled  disturbance  terms  assigned  to  perceptions  of 
role  overload  and  to  state  anxiety,  however,  indicate  that  there  may  be  other 
causes  of  these  variables. 


Insert  Figure  4.1  about  here 


FOr  the  sake  of  illustration,  let  us  assume  that  the  conditions  for 
the  appropriateness  of  a  theoretical  model  (Conditions  1  through  7)  are 
satisfied  and  that  this  model  is  an  accurate  representation  of  causal 
relationships.  Also  let  us  assune,  again  for  the  sake  of  illustration, 
that  the  correlation  between  the  two  latent  exogenous  variables,  expected 
quality  of  role  performance  and  expected  quantity  in  role  performance, 
is  zero. 

So  far,  we  have  stated  our  model  only  in  terms  of  latent  variables. 
It  is  not  possible  to  estimate  the  strengths  of  causal  relationships  in 
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the  latent  structural  model  because  the  latent  variables  are  unmeasured. 

To  estimate  the  strengths  of  these  causal  relations,  it  is  necessary  to 
operationalize  each  of  the  latent  variables  in  terms  of  manifest  variables 
that  are  believed  to  be  caused  by  a  latent  variable.  In  this  sense,  each 
latent  variable  has  the  role  of  a  common  factor,  and  the  manifest 
variables  serve  as  manifest  ind icators  of  the  common  factor. 

Figure  4.1b  presents  possible  manifest  indicator  variables  for  each 
of  the  latent  variables.  This  is  referred  to  as  the  measurement  model 
because  it  specifies  presumed  causal  relations  between  latent  variables 
(causes)  and  the  manifest  or  measured  variables  (effects)  that  serve  as 
indicators  of  the  latent  variables.  For  example,  four  indicators  of 
expected  quality  of  work  performance  might  be  role  demands  regarding 
(a)  reduction  in  waste  (reduction-waste),  (b)  reduction  in  number  of 
faulty  units  produced  (reduction- faulty  units)  ,  (c)  increased  number  of 
inspections  (increased  inspections) ,  and  (d)  increased  dependability  of 
product  in  tests  of  randomly  selected  units  (increased  dependability). 

The  small  £  (epsilon)  associated  with  each  manifest  indicator  of 
expected  quality  is  a  disturbance  term  that  includes  all  causes  of  the 
manifest  variable  other  than  the  latent  variable,  such  as  measurement 
error  and  nonrandom  measurement  errors. 

Multiple  indicators  are  considered  necessary  for  expected  quality 
of  work  because  either  (a)  expected  quality  can  be  manifested  in  a 
number  of  ways,  or  (b)  individual  indicators  may  be  measured  with  error, 
or  both  (a)  and  (b) .  In  contrast,  only  one  manifest  indicator  variable 
is  associated  with  the  quantity  of  work  latent  variable,  nanely,  the 
degree  to  trtiich  production  quotas  are  satisfied  (production  quotas)  . 
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Furthermore,  the  production  quotas  variable  has  no  disturbance  term. 

The  use  of  one  manifest  indicator  variable  denotes  that  the  manifest 
variable  is  regarded  as  a  perfect  indicator  of  the  latent  variable. 

This  means  that  the  indicator  is  the  primary  way  that  the  latent  variable 
is  manifested  and  that  the  indicator  contains  neither  random  nor  nonrandom 
measurement  errors.  This  appeared  reasonable  inasmuch  as  production 
quotas  are  the  key  measures  of  quantity  or  work,  are  publicly  stated, 
and  the  degree  to  which  they  are  satisfied  is  measured  objectively. 

Such  is  not  the  case  for  the  role  overload  and  state  anxiety  latent 
variables,  for  v»hich  multiple  manifest  measurements  are  often  available,  each 
of  >hich  contains  error.  Thus,  multiple  indicators  are  needed,  which  for 
role  overload  might  be  the  extent  to  which  employees  perceive  (a)  the  need  to 
v*)rk  extended  hours  (extended  hours)  ,  (b)  an  insufficient  number  of  employees 
to  accomplish  role  demands  (insufficient  personnel) ,  (c)  insufficient  time  to 
complete  tasks  (insufficient  time),  and  (d)  managerial  pressure  for 
productivity  (managerial  pressure) .  The  four  manifst  indicators  of  state 
anxiety  might  include  two  self-report  questionnaires  for  state  anxiety,  such 
as  (a)  the  Behavioral  Reactions  Questionnaire  (BRQ;  Endler  &  Magnusson, 

1977),  and  (b)  the  State-Trait  Anxiety  Inventory  (S-T  Inventory;  Spielberger, 
1977)  ,  and  two  physiological  indicators,  such  as  (c)  gastrointestinal 
distress,  and  (d)  headaches.  Each  of  the  manifest  indicators  for  role 
overload  and  state  anxiety  are  also  believed  to  be  caused  by  a  disturbance. 

When  manifest  variables  are  regarded  as  nonperfect  indicators  of  latent 
variables,  the  goodness  of  fit  of  the  measurement  model  must  be  tested.  In 
the  present  example,  this  suggests  that  the  goodness  of  fit  of  the  indicators 
for  quality  of  work,  role  overload,  and  state  anxiety  must  be  tested.  The 


test  takes  the  form  of  a  confirmatory  factor  analysis,  and  is  designed  to 
ascertain  if  manifest  indicator  variables  are  related  to  latent  variables  in 
the  manner  predicted  by  the  measurement  model.  The  test  is  discussed  later 
in  this  section;  this  discussion  includes  a  rationale  for  why  at  least  four 
manifest  indicators  are  required  for  each  latent  variable  in  order  to  conduct 
a  goodness  of  fit  test. 

Given  reasonable  satisfaction  of  the  goodness  of  fit  of  the  measurement 
model,  it  is  possible  to  proceed  to  estimate  the  strengths  of  the  causal 
connections  among  the  latent  variables  and  to  test  the  goodness  of  fit  of  the 
latent  structural  model.  Statistical  procedures  for  accomplishing  these 
tasks  are  also  explained  later.  At  this  time  we  note  only  that  the 
estimation  and  testing  processes  involve  combining  the  measurement  model  with 
the  latent  structured  model.  The  combined  models  for  this  illustration  are 
shown  in  Figure  4.2.  A  model  taking  the  form  of  Figure  4.2  is  often  referred 
to  as  a  "latent  variable  model"  or  a  "latent  variable  structural  equation 
model" . 


Insert  Figure  4.2  about  here 


Latent  variable  models  versus  manifest  variable  models;  Hie  question 
of  reliability.  A  major  drawback  of  manifest  variable  models  is  that 
they  will  yield  biased  estimates  of  structural  parameters  and  path 
coefficients  linking  latent,  hypothetical  construct  variables  when  these 
variables  are  represented  by  fallible  (not  perfectly  reliable)  manifest 
indicator  variables  in  a  manifest  variable  path  analysis.  For  example, 
consider  first  the  path  model  shown  in  Figure  4.3a.  This  is  the  latent 
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structural  model  presented  in  Figure  4.1a.  The  variables  in  the  model  are 
represented  by  perfectly  reliable  standardized  latent  variables.  They  are 
considered  standardized  because  units  of  measurement  are  arbitrary,  and  zero 
means  and  unit  variances  are  convenient. 


Insert  Figure  4.3  about  here 


Again,  for  the  sake  of  illustration,  let  us  assume  that  this  model  is 
an  accurate  representation  of  causal  relationships.  Furthermore,  let  us 
assume  that  in  the  population  of  employees  to  be  considered,  the 
(population)  path  coefficients  associated  with  the  latent  variables  are  as 
they  are  given  in  Figure  4.3a.  (Primes  denote ‘path  coefficients  linking 
latent  variables) .  Note  that  the  magnitudes  of  these  path  coefficients 
suggest  that  perceptions  of  role  overload  are  influenced  more  by  role 
expectations  regarding  quality  of  performance  (£' 31  =  .60)  than  by  role 
expectations  regarding  quantity  of  performance  (p'32  =  -50).  Again, 
let  us  assume  that  the  correlation  between  the  two  latent  exogenous 
variables  is  zero.  Finally,  we  assume  that  the  two  exogenous  variables 
do  not  relate  directly  to  the  latent  state  anxiety  variable. 

Suppose  that  for  each  latent  variable  of  the  model  we  now  choose 
one  manifest  variable  to  serve  as  an  indicator  of  the  latent  variable. 

But  suppose,  as  often  happens,  that  many  of  the  manifest  indicators 
are  not  perfectly  reliable.  Then  consider  Figure  4.3b,  which  is  a  latent 
variable  model  that  includes  both  the  structural  and  measurement  models. 
That  is,  we  have  added  four  standardized  manifest  variables  to  the  model, 
which,  with  the  exception  of  production  quotas,  are  fallible  indicators 
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of  the  respective  latent  variables.  The  three  fallible  indicators  are 
easily  identified  because  they  are  viewed  as  effects  of  a  disturbance 
designated  by  e_.  Disturbance  terms  are  regarded  as  latent  variables  and 
are  considered  here  to  be  in  standardized  form.  Moreover,  these  latent 
disturbances  are  regarded  as  consisting  exclusively  of  errors  of  measurement 
in  this  illustration.  This  means  then  that  the  measurements  (scores)  on 
each  falliable  manifest  variable  are  presaned  to  be  a  function  of  both  a 
latent  variable  and  randan  measurement  error. 

Given  standardized  variables,  the  causal  effects  of  the  latent 
variables  on  the  manifest  variables  may  be  represented  as  path 
coefficients.  In  fact,  because  we  assune  that  the  £  consist  only  of 
randan  measurement  error,  the  path  coefficients  associated  with  the  arrows 
going  from  the  latent  variables  to  their  manifest  indicators  represent 
the  square  roots  of  the  reliabilities  of  the  manifest  variables.  For 
example,  the  path  coefficient  linking  the  expected  quality  latent  variable 
to  its  manifest  indicator,  increased  dependability,  is  .81;  the 
corresponding  reliability,  shown  in  parentheses,  is  .65.  Note  that  only 
production  quotas,  the  manifest  indicator  for  the  latent  variable  of 
expected  quantity  of  role  performance,  has  a  reliability  of  1.00.  As 
before,  this  means  that  the  manifest  variable  is  a  perfect  indicator  of  the 
latent  variable.  The  reliabilities  of  the  remaining  three  manifest  variables 
are  less  than  1.00,  indicating  randan  error  in  measurement  and  the  lack  of 
perfect  correspondence  between  latent  and  manifest  variables. 

By  processes  to  be  explained  later  in  this  section,  it  is  possible  to 
solve  for  the  correlations  among  the  four  manifest  variables  using  the  path 
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coefficients  showi  in  Figure  4.3b.  These  correlations  can  then  be  used  to 
solve  for  the  path  coefficients  linking  the  manifest  variables  by  the  OLS 
procedures  discussed  in  Section  2.  The  results  of  the  path  analysis  on  the 
manifest  variables  are  reported  in  Figure  4.3c. 

It  is  now  interesting  to  compare  the  path  coefficients  in  the  manifest 
variable  model  (which  do  not  have  primes)  in  Figure  4. 3c  with  the  "correct" 
path  coefficients  of  the  latent  variable  model  in  Figure  4.3a.  The 
corresponding  values  for  these  path  coefficients  are  not  the  same.  This 
means  that  if  we  use  a  manifest  variable  approach  with  fallible  indicators 
of  latent  variables  to  estimate  the  path  coefficients  among  the  presumed 
perfectly  reliable  latent  variables,  then  the  estimates  will  be  biased.  In 
this  illustration  the  estimates  are  all  attenuated.  Moreover,  the  bias  is 
sufficient  to  lead  one  to  reverse  the  relative  importance  of  causes  on  a 
dependent  variable.  For  example,  in  Figure  4.  3c  the  path  coefficient 
relating  the  indicator  for  expected  quality  of  role  performance  (i.e., 
increased  dependability)  to  the  indicator  for  role  overload  (i.e., 
insufficient  time)  has  a  value  of  .42  (p^  =  *42) .  This  value  is  less 
than  the  path  coefficient  relating  the  indicator  for  expected  quantity  of 
role  performance  (production  quotas)  to  the  role  overload  indicator  (p^ 

=  .44)  .  This  is  just  the  reverse  of  the  "correct"  situation  in  Figure 
4.3a,  where  the  path  coefficients  are  =  *60  ^  £32'  =  *50, 
respectively. 

In  sum,  measurement  error  in  manifest  variables  may  have  serious 
consequences  for  confirmatory  analysis.  If  the  measurement  error  is 
regarded  as  nontrivial,  then  corrective  procedures  are  required. 

Corrective  options  include  the  latent  variable  form  of  analysis  to  be 


135 


described  here,  as  well  as  other  procedures  described  elsevrtiere,  such 
as  correction  for  attenuation  in  path  analysis  (cf .  Kenny,  1979)  ,  and 
instrumental  variables  and  two-stage  least  squares  (cf.  Goldberger, 

1973;  James  &  Singh,  1978) . 

Summary.  Random  measurement  error  is  not  the  only  reason  for 
considering  latent  variable  models.  In  fact,  good  reasons  exist  for  using 
a  latent  variable  approach  even  if  manifest  variables  can  be  measured  with 
a  high  degree  of  reliability.  Consider  the  points  made  earlier — that  many 
latent  variables  (hypothetical  constructs)  are  abstractions  and  can  be 
operationalized  by  any  number  of  manifest  indicator  variables.  An 
important  aspect  of  confirmatory  analysis  with  latent  variables  is  a  test 
of  the  goodness  of  fit  of  an  a  priori  measurement  model,  which  is  to  say, 
a  test  of  whether  multiple  manifest  indicators  are  related  to  latent 
variables  by  a  structure  hypothesized  by  the  investigator.  In  effect,  this 
is  a  test  of  the  construct  validity  of  the  manifest  indicators  and,  by 
implication,  of  the  adequacy  of  the  proposed  latent  variables.  Thus,  a 
latent  variable  analysis  makes  possible  a  test  of  construct  validity, 
another  key  measurement  concern.  Of  at  least  equal  importance  is  the  fact 
that  (a)  given  a  good  fit  between  a  proposed  measurement  model  and  the 
observed  data  on  manifest  indicators,  then  (b)  it  is  possible  to  estimate 
the  strengths  of  causal  relations  among  the  latent  variables.  As  in 
manifest  variable  models,  the  causal  relations  among  latent  varibles  are 
estimated  within  the  context  of  a  structural  model  relating  the  latent 
variables.  Goodness  of  fit  tests  based  on  the  logic  of  Condition  10  are 
performed  if  the  latent  structural  model  is  over identified. 

In  summary,  latent  variable  analysis  provides  the  opportunities 
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(a)  to  work  with  perfectly  reliable  causes  and  effects,  (b)  to  test  the 
goodness  of  fit  of  a  presumed  measurement  model,  and  (c)  to  test  the 
goodness  of  fit  of  a  structural  model  relating  latent  variables  both  in 
terms  of  estimating  causal  parameters  and  in  terms  of  testing 
overidentifying  restrictions. 

One  does  not  partake  of  the  benefits  of  a  latent  variable  approach 
without  paying  a  heavy  price.  The  lesser  part  of  the  price  is  that  one 
must  conquer  a  complex  and  often  cumbersome  terminology.  More  demanding 
is  a  requirement  that  one  have  a  methodological  sophistication  that 
exceeds  even  that  required  in  the  most  complex  multivariate  course 
generally  offered  in  psychology.  However,  we  believe  the  price  is 
worthwhile  for  those  desiring  the  benefits.  In  the  discussion  to  follow, 
we  have  tried  to  ease  the  pain  by  focusing  on  the  logic  of  the  approach 
and  merely  overviewing  many  of  the  statistical  equations.  However,  we 
believe  that  it  is  important  to  maintain  fidelity  with  what  truthfully  is 
a  very  complex  procedure.  It  is  also  noteworthy  that  the  pitfalls  of  the 
latent  variable  procedures  are  not  as  well-known  as  those  associated  with 
the  manifest  variable  procedures,  which  is  due  to  the  fact  that  the  latent 
variable  approaches  were  developed  much  later  than  the  manifest  variable 
approaches.  Finally,  our  attention  in  this  section  is  focused  on  the 
operational  aspects  of  latent  variable  analysis.  However,  as  reviewed  at 
the  end  of  this  section,  the  seven  conditions  pertaining  to  the 
appropriateness  of  theoretical  models  for  confirmatory  analysis  are  every 
bit  as  important  for  latent  variable  models  as  they  are  for  manifest 


variable  models 


137 


Representation  of  Structural  Models  with  Latent  Variables. 

Notation.  With  the  need  now  to  distinguish  between  manifest  and 
latent  variables  in  our  discussion,  we  need  a  new  notation.  The  notation 
we  will  use,  with  minor  modifications  on  our  part,  was  suggested  by  Bentler 
and  Weeks  (1980).  In  Figure  4.4  we  show  the  hypothetical  structural  model 
of  Figure  4.3a  in  the  new  notation.  In  this  notation  latent  exogenous  and 
latent  endogenous  variables  are  represented  in  structural  diagrams  by 
letters  enclosed  in  circles  or  ellipses.  Latent  exogenous  variables  are 
designated  by  the  Greek  letter  E  (xi)  with  subscripts,  while  latent 
endogenous  variables  are  designated  by  the  Greek  letter  (eta)  with 
subscripts.  Disturbance  terms  or  variables  are  always  treated  as  latent 
variables  and  are  identified  by  the  Greek  letter  e.  (epsilon)  with 
subscripts  but  without  enclosing  circles  or  ellipses. 


Insert  Figure  4.5  about  here 


As  in  structural  diagrams  with  manifest  variables,  causal  connections 
between  variables  are  indicated  by  arrows,  with  each  arrow  pointing  from  a 
causal  variable  to  an  effect  variable.  The  lack  of  an  arrow  between  two 
variables  indicates  the  lack  of  a  direct  causal  connection  between  these 
variables.  Associated  with  each  arrow  is  a  structural  parameter  that 
indicates  the  nunber  of  units  of  change  in  the  effect  variable  resulting 
from  a  unit  change  in  the  causal  variable,  holding  all  other  causes  of  the 
effect  constant.  A  structural  parameter  corresponding  to  a  missing  arrow 
is  regarded  as  fixed  and  equal  to  zero.  The  structural  parameter 
associated  with  an  arrow  from  an  exogenous  variable  to  an  endogenous 


variable  is  designated  by  the  Greek  letter  y  (gamma)  with  subscripts,  the 
left-most  subscript  indicating  the  number  of  the  endogenous  variable  and 
the  right-most  subscript  indicating  the  number  of  the  exogenous  variable. 
The  structural  parameter  associated  with  an  arrow  frcm  one  endogenous 
variable  to  another  endogenous  variable  is  indicated  by  the  Greek  letter  a 
(alpha)  with  subscripts,  with  the  left-most  subscript  the  number  of  the 
endogenous  variable  that  is  the  effect  variable  and  the  right-most 
subscript  the  number  of  the  endogenous  variable  that  is  the  causal 
variable.  The  structural  parameter  associated  with  an  arrow  from  a 
disturbance  variable  to  an  endogenous  variable  is  designated  by  the  Greek 
letter  6  (delta)  with  subscripts.  TVo-headed  arrows  connecting  pairs  of 
exogenous  variables  or  disturbance  variables  indicate  nonzero  covariance 
relations  between  these  variables. 

In  Figure  4.4  and  £2  are  latent  exogenous  variables.  Both  are 
causes  of  latent  endogenous  variable  n  but  not  of  latent  endogenous 
variable  B2»  The  two-headed  arrow  between  latent  exogenous  variables 
and  C2  indicates  a  presumed  nonzero  covariance  between  them.  Latent 
endogenous  variable  3)^  is  in  turn  a  cause  of  latent  endogenous  variable 
J)2.  Variables  and  f2  are  disturbance  variables,  each  associated  with 
its  respective  endogenous  variable.  The  disturbance  variables  are  presumed 
to  be  uncorrelated. 

A  model  with  all  latent  variables  in  it,  like  the  model  in  Figure 
4.4.  cannot  be  tested  empirically  until  it  is  operationalized  by  linking 
the  latent  variables  to  manifest  variables  in  an  appropriate  way.  The 
linkage  of  a  latent  variable  to  manifest  variables  in  a  model  is  usually 
accomplished  by  including  at  least  four  manifest  indicator  variables  that 
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are  presumed  to  be  effects  of  (i.e.,  caused  by)  the  latent  variable.  The 
reason  four  manifest  indicator  variables  are  required  is  so  that  the  latent 
variable  and  the  structural  parameters  indicating  the  degree  of  its  effect 
on  the  indicator  variables  will  be  overidentified.  This  allows  one  not 
only  to  estimate  uniquely  the  structural  parameters  but  also  to  test  the 
presumption  that  the  manifest  indicator  variables  have  a  latent  variable  as 
a  common  causal  variable.  Such  a  situation  is  illustrated  in  Figure  4.5, 
which  shows  four  indicator  variables  each  an  effect  of 

latent  exogenous  variable  and  one  of  the  mutually  uncorrelated 
disturbance  variables  €^,...,  e^.  Figure  4.5  also  serves  to  introduce 
notation  for  manifest  variables  in  structural  models  with  both  manifest  and 
latent  variables.  Notice  that  variables  ^,...,^4  are  enclosed  in 
boxes.  This  means  they  are  manifest  variables.  They  are  furthermore 
designated  by  the  letter  which  indicates  that  they  are  manifest 
endogenous  variables.  The  Greek  letter  Y  (gamma)  is  again  used  to 
indicate  a  structural  coefficient  of  a  causal  relation  between  an  exogenous 
variable  and  an  endogenous  variable.  Moreover,  the  Greek  letter  &  (delta) 
is  again  used  to  indicate  the  structural  coefficient  of  the  effect  of  a 
disturbance  variable  on  an  endogenous  variable. 


Insert  Figure  4.5  about  Here 


Our  notation  also  permits  us  to  have  manifest  exogenous  variables.  As 
shown  later,  these  will  be  designated  by  the  letter  x  enclosed  in  a  box. 
However,  manifest  variables  selected  as  indicato rs  of  a  latent  exogenous 
variable  are  ordinarily  endogenous  and  not  exogenous  variables.  The  reason 
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this  is  so  is  because  the  manifest  variables  chosen  as  indicators  of  a 
latent  exogenous  variable  are  usually  effects  of  not  only  the  latent 
exogenous  variable  in  question  but  of  extraneous  variables  as  well,  which 
are  subsmed  under  the  respective  disturbance  variables  attached  to  the 
manifest  indicator  variables.  Not  the  least  important  of  these  extraneous 
variables  will  be  errors  of  measurement.  Furthermore,  the  effects  of  a 
latent  variable  on  its  manifest  indicator  variables  may  be  measured  in 
different  units  of  measurement,  which  would  be  reflected  in  different 
values  for  the  structural  parameters  relating  the  latent  variable  to  its 
respective  manifest  indicator  variables. 

However,  before  we  proceed,  we  must  consider  how  we  would  treat  an  ideal 
situation  in  which  we  had  more  than  one  presumably  "pure",  disturbance-free 
manifest  indicator  variable  of  a  latent  variable.  Figure  4.6  illustrates 
just  such  a  model.  Variables  y^,...,y^  are  disturbance-free  manifest 
indicators  of  latent  exogenous  variable  Ij^.  By  "disturbance- free"  in  this 
case  we  mean  that  each  indicator  variable  has  only  the  latent  variable  ^ 
as  a  cause.  There  are  no  extraneous  causes  in  addition  to  the  latent 
variable  In  this  example  the  indicator  variables  are  perfectly 

correlated  with  the  latent  exogenous  variable  (and  with  one  another).  They 
differ  among  themselves  in  having  different  values  for  the  structural 
parameters  relating  them  to  the  latent  exogenous  variable  because,  say,  they 
are  measured  vrith  different  units  of  measurement. 


Insert  Figure  4.6  about  Here 


When  we  have  more  than  one  disturbance- free  indicator  of  a  latent 
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variable,  then  we  should  discard  all  but  one  of  these  for  use  in  a  model. 
This  will  avoid  the  problem  of  singularity  for  the  variance-covariance 
matrix  of  the  manifest  variables  that  arises  from  having  several  variables 
that  are  simple  linear  combinations  of  other  variables  in  the  analysis.  (A 
singular  or  noninvertable  sample  variance-covariance  matrix  for  the 
manifest  variables  would  preclude  obtaining  estimates  of  the  structural 
parameters  of  the  model) .  The  indicator  variable  retained  and  the  latent 
variable  it  "indicates"  can  then  be  treated  as  simply  identical.  That  is, 
we  can  replace  the  latent  variable  by  the  manifest  variable  in  the  model. 

If  the  latent  variable  replaced  by  a  "disturbance-free"  manifest  indicator 
has  a  disturbance  term,  because  the  latent  variable  is  an  endogenous 
variable  as  well,  then  the  manifest  indicator  acquires  the  disturbance  term 
of  the  original  latent  variable.  This  disturbance  is  not  the  disturbance 
of  which  the  manifest  indicator  is  "disturbance- free" .  Saying  that  the 
manifest  indicator  is  a  disturbance- free  indicator  simply  means  that 
relative  to  the  latent  variable  as  a  cause  of  the  indicator,  there  are  no 
other  extraneous  causes.  This  says  nothing  about  the  extraneous  causes  of 
the  latent  variable  that  the  indicator  may  replace. 

Thus,  given  these  notational  considerations,  we  might  have  a  model 
like  that  in  Figure  4.7.  In  this  model  we  have  one  latent  exogenous 
variable,  linked  to  four  manifest  variables,  each 

of  which  is  also  linked  to  one  of  four  disturbance  variables 
6,..  We  also  have  a  single  manifest  exogenous  variable  x2*  Variables 
^  and  are  also  causes  of  a  manifest  endogenous  variable,  y^,  to 
which  is  also  drawn  a  disturbance  variable  representing  a  component  of 
^  not  caused  by  the  exogenous  variables.  Note  that  with  this  exanple 
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we  have  introduced  a  manifest  exogenous  variable  and  denoted  it  by  the 
subscripted  letter  x  contained  within  a  box.  Also  we  have  used  the 
Greek  letter  Y  (gamma)  to  denote  structural  parameters  relating  exogenous 
variables  to  endogenous  variables,  while  8  (delta)  denotes  the  structural 
parameter  relating  a  disturbance  variable  to  an  endogenous  variable. 


Insert  Figure  4.7  about  Here 


A  situation  in  which  one  has  four  exogenous  manifest  variables  regarded 
as  causes  of  a  single  latent  variable,  which  we  illustrate  by  a  model  in 
Figure  4.8,  should  not  be  confused  with  a  situation  in  which  one  has  four 
manifest  indicator  variables,  each  an  effect  of  a  common  latent  variable, 
such  as  illustrated  in  Figure  4.7.  In  Figure  4.8  we  presume  that  the 
manifest  variables  x^,...,^  are  perfectly  reliable  but  not  strongly 
correlated,  exogenous  variables.  Each  of  these  manifest  exogenous 
variables  is  a  distinct  cause  of  latent  variable  y\ which  also  is  an 
effect  of  a  latent  disturbance  variable  .  Latent  variable  ^  is  in 
turn  a  common  cause  of  manifest  indicator  variables  eac^ 

with  its  respective  disturbance  variable  Again  Y  refers  to 

structured  parameters  relating  exogenous  to  endogenous  variables,  <*■ 
to  structural  parameters  relating  endogenous  to  endogenous  variables,  and 
8  to  structural  parameters  relating  disturbance  terms  to  endogenous  variables. 


Insert  Figure  4.8  about  Here 


Note  also  that  in  these  models  we  have  numbered  variables  in  the 
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following  way:  Exogenous  variables  are  grouped  together  and  latent 
exogenous  variables  are  numbered  first,  before  manifest  exogenous 
variables  are  numbered.  Similarly,  endogenous  variables  are  grouped 
together  and  latent  endogenous  variables  are  numbered  before  manifest 
endogenous  variables.  Disturbance  variables  receive  the  number  of  the 
corresponding  endogenous  variable. 

Structural  equations.  Consider  the  structural  model  in  Figure  4.9. 
This  model  represents  an  operationalization  of  the  latent  construct 
variables  of  the  model  in  Figures  4.3a  and  4.4.  Note  that  the  model 
contains  both  latent  and  manifest  exogenous  variables  as  well  as  latent  and 
manifest  endogenous  variables.  £-^  is  a  latent  exogenous  variable 
operationalized  by  four  manifest  endogenous  indicator  variables 
Y11, . . .  *2  a  man^est  exogenous  variable.  _§]_  and 

*2  are  both  causes  of  a  latent  endogenous  variable  as  indicated 
by  the  arrows  from  ^  and  x2  to  Four  manifest  endogenous 

variables  serve  as  indicators  of  as  indicated  by 

the  arrows  from  ^  to  these  variables.  ^  is  also  a  cause  of  latent 
endogenous  variable  H2,  which  in  turn  serves  as  a  common  cause  of 
manifest  endogenous  indicator  variables  %-j , . . . ,^Q.  Further  note 
that  each  endogenous  variable  in  this  model  has  one  of  the  latent 
disturbance  variables  6^,...,£^4  associated  with  it.  Also  observe  that 
there  are  no  double-headed  arrows  between  and  x2,  indicating  that 
these  exogenous  variables  are  uncorrelated,  while  there  are  also  no 
double-headed  arrows  among  any  of  the  disturbance  variables,  indicating 
that  they  too  are  mutually  uncorrelated. 

As  in  the  case  with  models  with  manifest  variables,  we  may  express  the 
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model  in  Figure  4.9  by  a  system  of  simultaneous  equations.  One  equation  is 
developed  for  each  latent  or  manifest  endogenous  variable,  which  means  we 
shall  have  14  equations.  Each  equation  includes  the  latent  and/or  manifest 
variables  that  have  a  direct  effect  on  the  endogenous  variable,  including 
disturbance  variables.  This  system  of  equations  follows: 
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(4.1) 


Matrix  Equations.  The  model  in  Figure  4.9  is  a  relatively  simple 
linear  structured  model  with  latent  variables.  But  latent  variable  models 
can  become  much  more  complex.  For  instance,  a  model  may  involve  many 
latent  causal  and  effect  variables,  each  of  which  is  represented  by  a 
number  of  manifest  indicator  variables.  In  other  latent  variable  models, 
the  exogenous  variables  may  be  intercorrelated .  Disturbance  variables  may 
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also  be  mutually  intercorrelated.  (Disturbance  variables  must  not  be 
correlated  with  exogenous  variables) .  Ordinary  algebraic  notation  thus 
becomes  extremely  cumbersome  to  deal  with  in  structural  models  with  latent 
variables.  And  so,  latent  variable  models  are  usually  described  with 
matrix  algebraic  equations. 

There  have  been  a  number  of  notational  systems  proposed  for 
representing  structural  equation  models  by  equations  in  matrix  algebra.  A 
very  popular  system  of  matrix  notation  is  used  in  connection  with  the 
LISREL  computer  program  of  Jflreskog  and  Sflrbcm  (1978) .  This  notation 
distinguishes  between  a  structural  equation  model  and  a  measurement  model. 
Distinguishing  between  these  two  models  is  based  on  the  assumption  that  all 
structural  equation  models  are  ultimately  hypothetical  models  involving 
latent  variables.  But  to  confirm  a  structural  equation  model  we  must  link 
the  hypothetical  structural  model  to  observed  variables  by  constructing, 
for  each  hypothetical  latent  variable  of  the  structural  model,  a  number  of 
manifest  variables  that  measure  effects  of  the  latent  variable.  The  latent 
variables  of  the  structural  model  are  then  regarded  as  common  factors  of 
the  manifest  indicator  variables.  The  measurement  model  thus  refers  to  the 
structure  expressed  as  a  confirmatory  factor  analytic  model  of  the 
relationship  between  the  hypothetical  latent  variables  and  their  manifest 
indicator  variables.  However,  a  quirk  of  the  LISREL  notation  is  that  no 
manifest  indicator  variable  may  be  an  effect  of  both  a  latent  exogenous  and 
a  latent  endogenous  variable  at  the  same  time.  Furthermore,  the  distinction 
in  the  LISREL  notation  between  the  structural  parameters  of  the  structural 
equation  model  and  the  "factor  loadings"  of  the  measurement  model 
introduces  unnecessary  complications  in  the  algorithm  and  the  computer  code 
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needed  to  implement  the  estimation  of  these  parameters. 

In  addition  to  the  LISREL  notation,  a  nunber  of  other  notations  have 
been  proposed,  apparently  in  an  attempt  to  find  a  general  model  that 
contains  all  structural  models — including  structural  equation  models, 
analysis  of  covariance  structure  models,  and  common  factor  analysis 
models — as  special  cases.  Of  note  are  McDonald's  (1978,  1980)  COSAN 
(Covariance  Structure  Analysis)  model  and  associated  program,  and  Bentler's 
(1979)  model.  McArdle  (1979,  1980)  proposed  that  all  these  models 
and  their  notations  are  special  cases  of  each  other.  He  suggested  that 
these  models  could  all  be  seen  as  special  cases  of  a  very  simple  but 
general  model  in  vhich  there  is  no  distinction  notationally  between  a 
structural  model  and  a  measurement  model  nor  a  need  for  the  multiple  levels 
of  nested  orders  of  factors,  as  had  been  proposed  by  McDonald  (1978)  or 
Bentler  (1976)  .  Bentler  and  Weeks  (1980)  proposed  a  matrix  notation  for  a 
general  structural  equation  model  that  may  be  seen  as  a  specialization  of 
McArdle' s  (1979).  We  shall  use  the  Bentler  and  Weeks  (1980)  matrix 
notation,  with  minor  modifications,  in  the  present  discussion. 

According  to  Bentler  and  Weeks  (1980) ,  there  are  in  structural  equation 
models  two  basic  kinds  of  variables,  independent  and  dependent 
variables.  In  structural  diagrams  dependent  variables  are  those  variables 
to  which  arrows  point  from  other  variables.  Independent  variables  do  not 
have  causal  arrows  pointing  to  them.  Now,  an  exanination  of  any  structural 
diagram  such  as  Figure  4.9  will  show  that  both  latent  and  manifest 
exogenous  variables  ( $  and  x)  as  well  as  latent  disturbance  variables  (6) 
have  no  causal  arrows  pointing  to  them.  They  are  independent  variables. 

On  the  other  hand,  latent  and  manifest  endogenous  variables  (h  and  y)  have 


arrows  pointing  to  them  and  are  therefore  dependent  variables.  We  will  now 
see  how  this  distinction  between  independent  and  dependent  variables  is 
made  in  a  matrix  notation  for  structural  equation  models  with  manifest  and 
latent  variables. 

The  "expanded"  matrix  equation  representing  the  simple  16  variable 
model  in  Figure  4.9  is  given  in  Table  4.1.  Dependent  variables  are 
represented  in  a  randan  vector  r\*  (eta  star)  that  may  be  partitioned  as 
U'  =  pV  ,  y']»  where  h'  is  a  (transposed)  randan  subvector  of  latent 
dependent  variables  and  is  a  (transposed)  randan  subvector  of 
manifest  dependent  variables.  (By  transposed  is  meant  that  the  array  has 
been  rearranged  so  -that  rows  becane  colunns  and  columns  becane  rows.  For  a 
single  colunn  vector,  a  one  dimensional  array,  transposing  this  vector 
means  the  colunn  becanes  a  row  of  elements — which  we  do  here  to  make  it 
convenient  to  display  the  vector  in  the  text) .  The  nunber  of  latent 
dependent  variables  in  *1'  is  indicated  by  m^;  the  nunber  of  manifest 
dependent  variables  in  y'  is  indicated  by  m_.  The  total  number  of 

■w  ^ 

dependent  variables  is  indicated  by  m,  where  m  =  +  m^.  The  order 

of  rj*  is  thus  m  x  1. 


Insert  Table  4.1  about  Here 


Independent  variables  are  all  included  in  a  single  random  vector  Z* 
(xi  star) .  This  vector  may  be  partitioned  to  distinguish  between  manifest 
and  latent  exogenous  variables  and  disturbance  variables.  Thus  we  may 
write  <*'  =  [5*  ,x'  ,V  ],  where  5*  stands  for  a  (transposed)  subvector  of 
latent  exogenous  variables,  x'  stands  for  a  (transposed)  subvector  of 
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manifest  exogenous  variables,  and  £_'  stands  for  a  (transposed)  subvector 

of  latent  disturbance  random  variables.  The  number  of  latent  exogenous 

variables  included  in  5'  is  n^;  the  number  of  manifest  exogenous 

variables  in  x  is  n_;  and  the  number  of  disturbance  variables  in  £'  is 

equal  to  m  (the  number  of  dependent  variables) .  The  number  of 

independent  variables  in  E*1  is  thus  n^  +  n2  +  ID  =  H •  an^  so  the 

order  of  E*  is  n  x  1. 

— 

The  structural  parameters  tt.  ^  that  relate  dependent  to  dependent 
variables  are  included  in  the  square  m  x  m  matrix  A  (alpha) .  Each  row 
of  A  corresponds  to  one  of  the  dependent  variables  and  contains  the 
structural  parameters  of  those  other  dependent  variables  that  are  causes  of 
that  dependent  variable.  The  structural  parameter  in  the  ^th  row  and 
jth  column  of  A  thus  represents  the  amount  of  change  in  the  j^th 
variable  that  results  from  a  unit  of  change  in  the  jth  dependent  variable. 
If  the  element  in  the  j_th  row  and  jth  column  is  a  zero,  this  means  that 
the  jth  dependent  variable  is  not  a  cause  of  the  _ith  dependent 
variable.  The  elements  of  the  diagonal  of  A, are  thus  ordinarily  zero, 
meaning  that  a  dependent  variable  does  not  cause  itself.  In  recursive 
models  the  order  of  the  dependent  variables  can  be  arranged  so  that  the 
A  matrix  is  lower  triangular,  that  is,  nothing  but  zeroes  are  above  the 

•A* 

diagonal  of  A,  implying  that  a  dependent  variable  only  causes  other 
dependent  variables  beyond  it  in  their  relative  order.  In  nonrecursive 
models  such  as  those  involving  reciprocal  causation,  the  A  matrix  will 
have  nonzero  but  distinct  symmetric-counterpart  elements  both  below  and 
above  the  diagonal. 

The  structural  parameters  that  relate  independent  to  dependent 


t 


149 

variables  are  contained  in  the  matrix  r*  (gamma  star) .  The  matrix  r*  = 
[r:A]  is  partitioned  into  the  m  x  (n.  +  n_)  matrix  r  and  the 

~~~  -L  2  *** 

m  x  m  matrix  &  (delta) .  The  rows  of  r  correspond  to  the  different 
dependent  variables.  The  columns  of  £  correspond  to  the  different 
exogenous  variables.  A  zero  element  in  the  _ith  row  and  Juth  column  of  r 
means  that  the  kth  exogenous  variable  is  not  a  cause  of  the  ith 
variable.  In  the  present  example  r  is  a  14  x  2  submatrix.  a_  contains  the 
structural  parameters  relating  the  dependent  variables  to  their  respective 
disturbance  variables.  The  rows  of  A  thus  also  correspond  to  the  different 
dependent  variables  while  the  columns  of  A,  correspond  to  the  different 
disturbance  variables.  Normally  A  is  a  diagonal  matrix  (sometimes  even  an 
identity  matrix)  with  zeros  everywhere  except  in  the  principal  diagonal. 
That  A  is  a  diagonal  matrix  reflects  the  fact  that  each  dependent  variable 
is  directly  associated  with  a  different  disturbance  variable.  Thus  is 
the  structural  parameter  relating  the  ^Lth  variable  to  its  corresponding 
disturbance  variable.  In  the  present  example  A  is  a  14  x  14  matrix. 

Insert  Tfcble  4.2  about  Here 


♦ 


We  have  summarized  the  notation  we  will  use  in  Table  4.2.  Let  us  now 


turn  to  a  more  compact  form  for  the  general  matrix  formulation  of  the 
linear  structural  equation  model  with  latent  and  manifest  variables.  This 
form  is  given  in  the  general  case  by: 


[r  :  A  ] 


or  more  simply  by 


(4.2a) 
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in*  =  An»  +  r*C*  (4.2b)  . 

The  dependent  variables  in  are  found  on  both  sides  of  equations 
4.2a  and  4.2b.  This  is  not  the  usual  form  in  which  a  structural  equation 
is  written.  Wiley  (1973)  recommends  the  following:  Bring  the  expression 
containing  the  matrix  A  from  the  right  side  over  to  the  left  side  of  the 
equation  and  then  factor  >1*from  the  resulting  expression  to  obtain 
(X_  —  A)  =  r*  5**  we  then  define  B  =  (I  -  A) ,  we  may  write  the 
structural  equation  in  the  canonical  form  more  like  that  found  in 
econometric  textbooks 

B3*  =  r*£*  (4.3). 

Note  that  the  diagonal  elements  of  E3  (beta)  must  all  be  unity.  The 
nonzero  off-diagonal  elements  of  B,will  be  the  negative  values  of  the 
nonzero  off-diagonal  elements  of  A.  This  means  that  in  interpreting  the 
nature  of  the  causal  effects  one  will  have  to  mentally  change  the  signs  of 
the  beta  coefficients  in  the  matrix  B.  A  common  mistake  is  to  fail  to  make 
these  changes. 

If  we  prenultiply  both  sides  of  e  ^uation  4.3  by  IB  1  we  obtain  the 
"reduced  form"  of  the  structural  model  equation.  The  reduced- form  equation 
gives  dependent  variables  totally  as  functions  of  independent  variables. 

The  reduced  form  for  equation  4.3  is: 

rp  =  (4.4). 

The  goal  of  structural  equation  models  is  to  show  how  relationships 
among  manifest  variables  (given  by  either  correlations  or  covariances)  can 
be  explained  in  terms  of  the  structural  equations  relating  the  manifest 
variables  to  other  (possibly  latent)  variables  of  the  model,  lb  reach  this 
goal  we  will  need  certain  "selection"  equations  that  "select"  the  manifest 
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variables  in  the  subvectors  y  and  x,  respectively,  from  the  larger  vectors, 
and  £*,  of  variables  (Bentler  &  Weeks,  1980).  Oonsider  the  following 
selection  equations: 


y  =  [0:I]rn"|and  x  =  [0:1:0] 


>:iinr 

_y 


or 


(4.5a) 


y  =  Gyi|3*  and  x=£x£*  (4.5b) 

Gy  =  [0:1]  is  a  partitioned  (n^  x  m)  "selection"  matrix  with  0  an 


nt,  x  null  matrix  and  an  m,,  x  m^  identity  matrix.  In  other 


words,  Gy  contains  zero  elements  everywhere  except  for  a  single  elanent  of 


unity  in  each  row  placed  in  the  appropriate  column  of  Gy  to  "select"  a 


corresponding  manifest  dependent  v  variable  from  it*.  Similarly,  G„  = 
[0:1:0]  is  an  (n_  x  n)  selection  matrix  with  the  first  0  on  the  left  an 

**  <«>  —4  —  — 

n_  x  n.  null  matrix,  I  an  n_  x  n_  identity  matrix,  and  the  0  on 
the  right  an  n_  x  m  null  matrix.  In  other  words,  G  contains  zero 
elements  everywhere  except  for  a  single  element  of  unity  in  each  row  placed 
in  the  appropriate  colunn  of  £x  to  select  a  corresponding  manifest 
independent  x  variable  from  5*.  Tb  illustrate,  for  the  model  in  Figure  4.9, 


£y  is  a  12  x  14  matrix  with  0's  for  every  element  except  for  a  single  1  in 


each  row  in  the  column  corresponding  to  a  manifest  endogenous  variable  ^ 

in  tj*  .  Similarly,  G  is  a  1  x  16  matrix  [0  10000000000000  0] 
~x 


with  unity  corresponding  to  the  manifest  exogenous  variable  in  the 


vector  **■  =  [£1,x2,&  1  • 

We  also  need  to  define  a  matrix  reflecting  the  variances  and 
covariances  among  the  independent  variables  of  the  model.  As  explained 
above  all  independent  variables  are  included  in  the  random  vectors*.  Thus, 
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the  matrix  of  variances  and  covariances  among  the  independent  variables  is 
given  by  the  matrix 


♦  =  E(§*|*')  (4.6) 

where  4>  is  the  Greek  letter  phi.  The  model  requires  that  exogenous 
variables  are  independent  of  disturbance  variables  (i.e.,  the  model  is 
self-contained) .  This  requirement  is  expressed  mathematically  by  the 
requirement  that  E{  £  [?'  :x']J  =  0,  where  E{  )  is  the  expectation  operator. 
The  effect  of  this  requirement  appears  in  the  matrix  and  may  be  seen  in  a 
partitioning  of  this  matrix  as 


<t>  = 


ht.  h*  2 

Ix5  Jxx  2 
0.  0  £ee 


(4.7) 


where  =  E(€C  )  ,  9  -E^x1),  4>xx  =  E  (xx* )  and  *EE  =  E  (ee' )  . 

The  following  terms  are  set  equal  to  zero  in  Bq.  4.7:  $££  ,  4  , 

and  lex- 

From  the  reduced- form  Bq.  4.4,  the  selection  equations  Bq.  4.5  and 
Bq.  4.6,  we  are  able  bo  derive  the  variance-covariance  matrix 
contains  the  "predicted"  variances  and  covariances  among  the  manifest 
variables,  where  by  "predicted"  is  meant  that  the  variances  and  covariances 
take  on  the  values  they  would  have  if  they  were  generated  as  functions  of 
the  structural  parameters  of  a  hypothetical  structural  equation  model.  The 
matrix  equations  which  display  E_  as  functions  of  structural  parameters 
and  covariances  among  independent  variables  are  as  follows: 


£yy  lyx 


[Sxy  £xx. 


(4.8) 
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where,  according  to  the  model, 

£yv  =  }  =  y’  7 


-yy 

E  =  E (xy‘ )  =  G  $r*’B_1,G  1  , 
-xv  —  7  -x —  -  -v 


(4.9a) 

*xy  7  -x—  -  -y  '  (4.9b) 

Exx  =  E(xx')  =  G*Gx’  .  (4.9c) 

The  implications  of  Bq.  4.8  and  Bqs.  4.9a,b,c  are  that  a  predicted 
or  hypothetical  variance  covariance  matrix  Sq  for  the  set  of  observed 
variables  in  random  vectors  x  and  y  may  be  derived  from  the  parameter 
values  of  a  hypothetical  structural  equation  model.  Therefore,  the  degree 
to  which  the  hypothetical  structural  equation  model  reflects  reality  is 
given  by  the  degree  to  vhich  the  hypothetical  matrix  is  the  same  as 
the  empirical  variance-covariance  matrix  £  for  the  same  variables  (in  x  and 
y)  obtained  from  measurements  of  these  variables  in  the  world,  lb  make 
this  comparison  between  the  hypothetical  matrix  and  the  empirical 
matrix  £  is  the  goal  of  a  confirmatory  analysis  using  structural  equation 
models  with  latent  and  manifest  variables. 


Specifying  a  Structural  Model 

One  can  obtain  different  models  for  the  same  set  of  manifest  and 
latent  variables  in  g*  and  h*  by  assigning  different  values  to  the 
respective  elements  of  the  matrices  B,  T* ,  and  <p,  known  as  "paraneter 
matrices",  in  the  structural  equation  model.  In  other  words,  to  define  a 
structural  equation  model  with  observed  and  latent  variables  one  must 
specify  the  nature  of  the  elements  of  the  parameter  matrices  of  the  model. 
Any  particular  element  of  these  matrices  may  be  specified  [to  use 
terminology  popularized  by  JSreskog  (1973)]  in  one  of  three  ways: 

(1)  As  a  fixed  parameter,  given  a  value  specified  prior  to  the 
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analysis  tfiich  does  not  change  throughout  the  estimation  process. 

(2)  As  a  free  parameter ,  whose  value  is  to  be  estimated  from  the 
data  conditional  on  the  values  of  fixed  and  constrained  parameters 
in  the  model. 

(3)  As  a  constrained  parameter,  whose  value  is  yoked  together  with 
certain  other  parameters  that  are  all  constrained  to  equal  the 
same  value.  This  value  may  be  free  to  be  estimated  conditional 
on  the  values  of  fixed  and  other  sets  of  constrained  parameters. 

Specification  of  a  structural  model  is  tied  intrinsically  to  the  question 
of  identification.  Consequently,  we  shall  discuss  identification  and  then 
illustrate  its  application  in  regard  to  the  model  in  Figure  4.9. 

Identification.  It  is  essential  to  specify  a  structural  equation 
model  in  such  a  way  that  the  model  is  identified.  Whether  a  model  is 
identified  concerns,  in  the  broadest  sense,  distinguishing  one  theory  from 
another  theory  when  operationalizing  theory  with  data  (L'Esperance,  1972). 
With  structural  equation  models,  identifying  a  model  involves  specifying  a 
pr iori,  according  to  theory,  the  fixed  values  of  certain  parameters  of  the 
model  so  that  (a)  no  other  theoretical  model  (except  one  that  differs 
trivially  from  it  in  terms  of  different  units  of  measurement)  can  have  the 
same  fixed  parameter  values,  but  different  free  parameter  values,  and  (b) 
still  generate  the  same  hypothetical  covariance  matrix  for  the 
observed  variables.  In  other  words,  identifying  a  structural  model  means 
fixing  the  values  of  an  appropriate  set  of  parameters  in  the  model  so  that 
the  hypothetical  covariance  matrix  associated  with  this  set  of  fixed 
parameters  will  be  unique  for  each  distinct  complementary  set  of  free 
parameter  values. 
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Identification  also  concerns  fixing  enough  parameters  of  the 
parameter  matrices  B,r*,  and  $  so  that  the  remaining  parameters  can  then 
be  solved  for  in  terms  of  the  values  of  the  covariances  among  the  manifest 
variables  and  the  values  of  the  fixed  parameters.  If  an  insufficient 
number  and/or  inappropriate  set  of  parameters  are  fixed  a  priori,  the 
model  may  not  be  identified.  Then  there  may  be  no  consistent  solutions  for 
the  free  parameters  of  the  model.  In  tl  s  regard  Jflreskog  (1979)  has 
indicated  the  following  guidelines  for  evaluating  the  identification  of 
one's  model:  Let  s  be  the  number  of  free  and  constrained  parameters 
(each  distinct  constrained  paraneter  being  counted  only  once)  in  the 
matrices  B,  T*  and  $.  Then  a  necessary  (but  not  sufficient)  condition  that 
all  of  the  free  and  constrained  parameters  of  the  model  are  identified  is 
that 

s  <  (n^  +  n2)  (£2  +  H2  +  1)/2 

where  the  expression  on  the  right  is  the  number  of  independent  elements  of 
the  hypothetical  variance-covariance  matrix  2L  for  the  full  set  of 
observed  variables,  with  n 2  x  variables  arvd  m2  y  variables. 

To  understand  the  way  identification  of  a  free  or  constrained 
parameter  may  be  determined  by  some  elements  of  the  hypothetical  covariance 
matrix  Zq  and  the  values  of  the  fixed  parameters,  let  us  return  to 
EJqs.  4.9a  through  4.9c.  These  equations  imply  that  the  elements  of  the 
variance-covariance  matrix  are  functions  of  the  values  of  the  fixed 
and  free  parameters  in  the  parameter  matrices  of  the  model.  If  one  can 
find  a  way  to  9olve  uniquely  for  a  given  free  or  constrained  parameter  from 
elements  of  the  variance-covariance  matrix  and  the  values  of  the  fixed 
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parameters,  then  the  free  or  constrained  parameter  is  identified.  If 
there  is  only  one  way  of  using  elements  of  2^  to  solve  for  a  given  free 
or  constrained  parameter,  then  the  parameter  is  just  identified.  If 
there  is  more  than  one  way  (using  different  elements  of  IQ)  to  solve  for 
the  parameter,  then  the  parameter  is  over  identified .  If  all  free  and 
constrained  parameters  are  just  identified,  then  the  model  is  just 
identified.  If  in  addition  some,  but  not  all,  of  the  free  and  constrained 
parameters  are  over identified,  then  the  model  is  partially  over identified. 

If  all  free  and  constrained  parameters  of  the  model  are  over identified, 
then  the  model  is  over identified.  Finally,  applying  the  term  "identified" 
to  a  model  implies  that  the  model  is  at  least  just  identified. 

To  illustrate  identification,  let  us  consider  the  identification  of  the 
parameters  of  the  model  in  Figure  4.9  represented  by  the  system  of 
simultaneous  equations  in  Bq.  4.1.  We  assume  that  in  the  phi  matrix  for  the 
model  in  Figure  4.9  the  exogenous  variables  are  mutually  uncorrelated  with 
unit  variances  and  that  the  disturbance  variables  are  also  mutually 
uncorrelated;  that  is,  and  $Xx  are  identity  matrices,  $5*  =  0,  and 
is  a  diagonal  matrix.  We  may  then  derive  the  elanents  of  the 
hypothetical  variance-covariance  matrix  for  the  observed  variables  by  Eqs. 
4.9a  through  4.9c.  Actually,  one  does  not  need  in  practice  to  use  Eqs.  4.9a 
through  4.9c  per  se  to  find  the  variances  and  covariances  among  the 
manifest  variables  according  to  the  model.  Che  can  also  use  processes 
similar  to  those  discussed  in  connection  with  Condition  10  in  Section  2  to 
express  correlations  in  terms  of  path  coefficients.  These  processes  involve 
rules  one  can  apply  to  the  parameters  in  structural  diagrams  to  determine  the 
variances  and  covariances  in  a  rather  straightforward  way  (cf.  Costner, 


157 


1969;  Heise,  1975;  Namboodiri  et  al_.,  1975).  To  illustrate  these 
procedures  for  finding  covariances,  consider  that  part  of  the  model  in  Figure 
4.9  that  deals  with  causes  of  yy..,Yg,  which  we  have  reproduced  in 
Figure  4.10. 


Insert  Figure  4.10  about  here 


This  submodel  implies  that  the  covariance  between  and  Xj  is 
2  2 

equal  to  '  vg*iere  -i  *-s  t^ie  variance  of  *}  . 

The  basis  for  this  implication  is  that  ^3  anc*  X,  have  the  latent 
variable  as  a  common  cause.  Thus,  their  covariance,  indicated  by 
should  be  determined  by  the  extent  to  which  they  share  common 
variance  attributable  to  *1^,  which  is  given  by  multiplying  the  product  of 
the  parameters  relating  and  ^4  to  ^3  times  the  variance  of 

l). .  However,  for  simplification  we  will  require  that  the  variances  of 
all  latent  endogenous  variables  equal  unity.  Hence,  using  the  same  reasoning, 


the  complete  set  of  covariances  among  the  manifest  indicators 

of  ^  is: 

<T  =  0L  CL 

-34  -31-41 

(4.10) 

%  =  *31*51 

(4.11) 

~36  =  $lhl 

(4.12) 

*45  =  *41*51 

(4.13) 

*46  =  *41  %1 

(4.14) 

*56  =  *5l%l 

(4.15) 

We  may  also  obtain  these  same  results  by  applying  EJgs. 

4.9a  through 

4.9c  where  we  can  take  advantage  of  the  fact  that  the  fixed  parameters  set 
equal  to  zero  in  B  *  1^  -  A,  I**,  and  4  allow  us  to  simplify  expressions  for 
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these  same  covariances  as  in  Bqs.  4.10  through  4.15.  In  any  case,  however 

we  obtain  them,  we  have  six  equations  and  four  unknowns,  namely  a^,  0^, 

Ct^,  and  Q^.  This  means  we  may  solve  this  system  of  six  equations  in 

more  than  one  way  to  obtain  estimates  of  0!^,  ^61'  006  9°^ut^on 

for  these  parameters  is  given  by: 

>1/2 

,1/2 


^31  =  +%4?35/-°45)‘ 


*41  =  +  ^?34“45/-35) 


*51  "  +(?35?45/-34) 
*61  =  +(?36-46/234) 


1/2 

1/2 


(4.16) 

(4.17) 

(4.18) 

(4.19) 


But  Eqs.  4.16  through  4.19  do  not  represent  the  only  possible  solution  for 
the  unknown  parameters  a.^,  ...  >ct>^  Parameters  through  are 

overidentified- because  there  is  more  than  one  way  to  solve  for  each  of  the 
four  unknown  parameters  in  terms  of  different  elements  of  Z  .  For  example, 
we  used  Bqs.  4.10,  4.11  and  4.13  to  obtain  the  solution  for  in  Bq . 
4.16.  But  we  could  just  as  easily  have  used  Bqs.  4.11,  4.12  and  4.15, 
respectively,  to  solve  for  <*^1  as 

“31  =+(?35?36/?56)l/2  <4-20) 

Thus  is  an  overidentifed  parameter,  as  are  the  other  unknown  parameters 

of  this  system  of  equations. 


By  similar  procedures  we  can  solve  for  parameters  ou. ?2>  ...,0^  2 
in  connection  with  the  four  indicators  of  ^  and  for  parameters  ^ 

. ..,  Y,  a  1  in  connection  with  the  four  indicators  of  E, .  Because  in 
each  of  these  cases  there  are  four  indicators  of  a  common  latent  variable. 


the  structural  parameters  relating  these  indicators  to  the  common  latent 
variable  are  over identified. 


159 


The  free  parameter  QL^i  relating  *)  to  >12  is  also  an  over¬ 
identified  parameter,  lb  see  this  consider  that  the  covariances  between 
different  pairs  of  manifest  indicators  of  different  latent  variables  may  be 
obtained  as  the  products  of  structural  parameters  in  causal  paths  that 
radiate  out  frcm  a  single  latent  variable  and  terminate  at  each  end  with  one 
of  the  manifest  indicators  (see  Figure  4.11).  Thus,  applying  this  principle 
to  the  causal  paths  involving  the  parameter  ,  we  obtain 

?37  =  ^31^21^2 

?38  =  ^1*21-82 
-39  =  ^1^21^92 
-3,10  =  ^1^21^0,2 


(4.21) 

(4.22) 

(4.23) 

(4.24) 


Any  one  of  these  equations,  as  well  as  others  involving  covariances  for  pairs 
of  variables  from  other  sets  of  indicators  of  ^  and  ^ ,  respectively,  may 
be  used  to  solve  for  because  the  solutions  for  the  other  parameters 
in  each  of  these  equations  are  obtainable  using  other  elements  of  2Q. 
Parameter  &21  *-s  therefore  very  much  over  identified.  By  similar 
reasoning  we  can  show  that  ahd  V  2  are  also  over  identified.  For 

exanple,  <r  ^  -  X12*31.  -  ?u%v  **  »  »"• 

0^31  and  0^^  are  already  determined  by  other  elements  of 


Insert  Figure  4.11  about  Here 


The  structural  parameters  through  14  associated  with  the 
causal  paths  of  disturbance  variables  ^  through  6^  of  Figure  4.9  are 
identified,  for  they  may  be  determined  from  the  respective  diagonal 
elements  of  Eg  once  the  other  parameters  of  the  model  have  been 
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determined  from  the  off-diagonal  elements  of  2^.  That  is,  the  values 
estimated  for  ^3  through  6^  ^  will  reflect  the  residual  variances 
in  variables  that  cannot  be  accounted  for  by  the  functions  indicated  in 
the  model. 

The  structural  parameters  8  ^  and  associated  with  the  paths 
of  the  disturbance  variables  £  and  §2  for  the  latent  endogenous 
variables  and  will  not  be  identified  unless  one  makes  some 
specification  before  analysis  that,  in  effect,  specifies  the  variances  of 
the  latent  endogenous  variables  and  We  have  already  required 

the  variances  of  latent  endogenous  variables  to  equal  unity;  in  other  words, 
we  require  the  following  equalities  to  hold: 


*  2 

=  S  2 

+  y  2  +  V 

'11 

CN 

r—i 

\ 

r—i 
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These  equalities  properly  constrain  solutions  for  5^  and  §22'  So™6 
computing  algorithms  for  structural  equation  analysis  provide  for 
specifying  constraints  among  parameters  such  as  these  (Bentler  arri  Lee,  1982) . 
When  they  do  not,  the  user  must  specify  arbitrary  values  for  the  parameters 
to  fix  the  metric  of  their  respective  latent  endogenous  variables. 

In  sum,  because  all  parameters  not  involving  disturbances  of  the  model 
in  Figure  4.9  are  over identified,  the  model  may  be  said  to  be  over identified. 

It  is  important  to  emphasize  that  the  free  parameters  of  the  model  are 
just  identified  or  over identified  depending  on  the  assumptions 
(predictions)  made  in  specifying  values  for  the  fixed  parameters.  It  may 
not  be  obvious,  but  the  model  in  Figure  4.9  involves  a  large  number  of 


fixed  parameters.  For  example,  the  absence  of  an  arrow  from  £  to  ^ 
means  that  the  parameters  Y^  is  equal  to,  or  fixed,  at  zero.  Hie  same 
is  true  for  the  absence  of  an  arrow  from  x2  to  Y]^.  Furthermore,  every 
disturbance  variable  is  regarded  as  a  cause  of  one  and  only  one  manifest  or 
latent  endogenous  variable.  All  arrows  emitting  from  a  disturbance  to  any 
other  variable  have  parameters  fixed  at  zero  (i.e.  off-diagonal  elements 
of  A  must  be  zero) .  Furthermore  the  disturbance  variables  are  presumed  to 
have  unit  variances  and  to  be  mutually  uncorrelated,  with  their  covariance 
matrix  fixed  to  be  an  identity  matrix.  IWo  other  instances,  which  do 
not  exhaust  all  possibilities,  are  that  (a)  manifest  indicators  of  latent 
variables  are  caused  by  one  and  only  one  latent  variable,  and  (b)  the 
causal  relationship  between  ^  and  is  recursive,  implying  that  a^2 
=  0,  (i.e.  Jj2  is  not  a  reciprocal  cause  of  ^) .  Finally,  the  model 
contains  no  constrained-equal  parameters,  although  this  condition  could  be 
implemented  if  we  so  desired.  For  example,  we  might  theorize  that  Qt^ 
and  in  EJg.  (4.11)  are  of  equal  magnitude.  One  result  of  this 
specification  would  be  that  the  parameter  0 l  ^  is  overidentified  (i.e., 
one  would  have,  in  effect,  two  unknowis  in  three  equations) . 

Impl ications  of  identification  for  testing  goodness  of  fit.  As 
discussed  earlier,  the  degree  to  which  a  structured  equation  model  reflects 
reality  is  assessed  by  the  degree  to  which  £^,  the  hypothetical  variance- 
covariance  matrix  generated  according  to  Bgs.  4.9a  through  4.9c,  is  similar 
to,  or  has  a  good  fit  with,  the  matrix  E,  which  is  the  unrestricted, 
empirical  variance-covariance  matrix  for  the  same  manifest  variables.  In 
practice,  2^  and  £  are  replaced  by  sample  estimates,  namely  Eg  and  Sj 


respectively.  Identification  has  important  implications  for  testing  the  fit 

A 

between  and  S.  Consider,  first,  that  there  is  only  one  possible  solution 
for  the  estimate  of  a  just  identified  structural  parameter.  This  suggests 
that  estimates  for  the  free  and  constrained  parameters  of  a  just  identified 
model  can  always  be  foind  in  such  a  way  that  the  resulting  ZQ  matrix 
exactly  equals  the  matrix  S.  Just  identified  models  always  have  a  perfect 
fit  to  data.  But  such  good  fits  are  tautological  and  trivial,  and  in  no  way 
indicate  the  scientific  usefulness  of  a  model.  Cn  the  other  hand,  because  an 
overidentified  parameter  of  a  model  may  be  estimated  in  several  ways  (using 
different  sets  of  elements  of  the  sample  variance-covariance  matrix  S) , 
different  estimated  values  for  the  parameter  can  be  obtained.  More 
importantly,  the  different  estimates  for  an  overidentified  parameter  may  be 
inconsistent  (Costner,  1969).  In  practice,  the  methods  used  to  estimate  an 
overidentified  free  or  constrained  parameter  usually  take  some  weighted 
average  of  the  different  solutions  for  the  parameter.  As  a  result,  there  is 
some  freedom  for  the  sample  variance-covariance  matrix  S  to  differ  from  the 

a 

estimated  hypothetical  variance-covariance  matrix  Zg.  Thus, 

over identified  models  can  be  falsified  when  the  discrepancy  between  S  and 

A 

Zg  is  statistically  significant.  It  follows  that  an  over identified 
model  is  essential  for  testing  the  theory  on  which  the  identifying 
conditions  are  based. 

Since  structural  equation  models  with  latent  variables  are  a  bit  more 
complex  than  those  involving  only  manifest  variables,  the  task  of  assessing 
identification  is  more  difficult.  Furthermore,  because  the  emphasis  is  on 
fixed  parameters  of  the  various  parameter  matrices  of  the  model  (essential 
for  the  computing  algorithms)  ,  the  identification  problem  focuses  on 
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parameters  and  not  equations  as  does  the  traditional  literature  on 
structural  equation  modeling  with  manifest  variables.  Che  can  consult 
Fisher  (1966),  Schmidt  (1976),  Wiley  (1973),  Werts,  Jflreskog,  and  Linn 
(1973)  ,  or  Kbopnans  and  Hood  (1953)  for  general  principles  of 
identification.  Empirical  examples  are  given  by  Jdreskog  (1979a,  1979b)  . 

Estimation. 

Basic  pr inciples.  In  the  case  of  structural  models  with  latent 
variables,  we  make  the  assumption  that  the  distribution  of  the  manifest 
variables  is  a  function  of  their  variance-covariance  matrix.  Making  this 
assumption  allows  us  to  seek  estimates  of  the  free  and  constrained 
parameters  of  the  model  in  the  following  way:  Use  as  estimates  those  values 
for  the  free  and  constrained  parameters  of  the  matrices  B,r*,  and  $  that 
together  with  the  values  of  the  fixed  parameters  make  the  estimated  model 
variance-covariance  matrix  ZQ  for  the  manifest  variables  as  much 
like  the  sample  variance-covariance  matrix  S  as  possible.  We  will  now  show 
how  this  principle  is  implanented  in  the  major  methods  used  to  estimate 
parameters  of  structural  equation  models  with  latent  and  manifest  variables. 

A  measure  of  the  degree  of  similarity  or  fit  of  the  matrix  FQ 
to  the  matrix  S  is  given  by  a  function  of  the  elements  of  these  matrices 
known  as  a  loss  function.  The  term  "loss  function"  is  derived  from  the 
idea  that  when  estimating  a  parameter  one  takes  a  "loss"  when  the  value 
chosen  as  an  estimate  of  the  parameter  differs  frcm  the  actual  value  of  the 
parameter.  The  loss  function  thus  provides  a  measure  of  the  degree  of  this 
loss  by  measuring  some  function  of  the  discrepancy  between  estimate  and 
parameter.  A  desirable  property  of  a  loss  function  is  that  it  should  take 
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on  a  large  value  when  the  discrepancy  between  the  estimate  and  the  parameter 
estimated  is  large  and  equal  zero  when  there  is  no  discrepancy.  In 
addition,  when  a  nunber  of  parameters  are  to  be  estimated,  the  loss  function 
should  attempt  to  combine  the  individual  losses  into  an  overall  measure  of 
loss.  In  the  case  of  structural  models  with  latent  variables,  we  will  seek 
estimates  for  the  free  parameters  of  the  structural  model  so  that  the 

a 

estimated  variance-covariance  matrix  derived  from  these  free 
parameters  and  the  fixed  parameters  will  minimize  a  loss  function  defined  on 
2.  and  S. 

Important  methods.  A  number  of  loss  functions  are  available  for  this 
purpose,  lb  begin  with,  one  can  use  least  squares  (LS)  and  minimize  the 
function 

LS  =  trltS  -  iQ) '  (S  -  20)] 

This  method  of  estimation  minimizes  the  sum  of  squared  differences  between  the 

A  A 

elements  of  S  and  The  matrix  (S  -  SQ)  represents  the  element-by^ 

A  A 

element  difference  between  S  and  2g.  £Q  is  derived  using  Eqs.  4.9a 
through  4.9c  with  estimated  values  for  the  free  and  constrained  elements  and 
£  priori  values  for  the  fixed  elements  of  the  matrices  Ehr*,  and  <$.  The  sum 
of  squared  differences  between  the  elements  of  the  corresponding  rows  of 
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(S_  -  2q)  is  given  by  the  diagonal  elements  of  (S  -  2Q) '  (S  -  2^)  ,  a  square  matrix 
The  sum  of  the  diagonal  elements  of  a  square  matrix  is  knowi  as  the  trace 
of  the  matrix  and  is  denoted  by  tr[  ].  Thus  the  sum  of  the  diagonal 

A  A 

elements  of  (S  -  Zq)  '  (S  -  2Q) ,  or  the  trace  of  this  matrix, 


gives  the  sum  of  squared  differences  between  corresponding  elements  of  S  and 
.  In  other  words,  the  criterion  would  force  us  to  seek  values  of 
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the  free  and  constrained  elements  that  would  make  the  sun  of  squared 

A 

differences  between  the  elements  of  the  matrices  S  and  as  small 
as  possible.  Note,  however,  the  term  "least  squares  estimation"  in  this 
case  does  not  refer  to  the  ordinary  least  squares  estimation  process  used 
with  models  of  manifest  variables. 

Maximum  likelihood  estimation  begins  with  a  sample  from  a  population 
whose  distributional  type  is  known  (e.g.  the  multivariate  normal 
distribution)  and  seeks  those  values  for  the  unknown  parameters  of  the 
distribution  that  would  make  the  likelihood  of  obtaining  the  sample  a 
maximum.  If  we  used  the  loss  function  of  maximum  likelihood  estimation,  we 
would  seek  to  minimize  the  function 

ML  =  log  I  £q  |  +  tr(S^-1)  -  loglSl  -  (j^  +  n2) 

A  A 

which  approaches  zero  as  S  and  become  the  same,  log|EQ| 

A 

denotes  the  logorithm  of  the  determinant  of  the  square  matrix 

The  determinant  is  a  number  obtained  from  the  elements  of  the  matrix  in  a 

way  too  complex  to  describe  here.  Note  that  if  S  and  are  equal,  then 

A 

loglZgl  and  log|S|  wall  be  the  same  and  their  difference  will  be 
equal  to  zero.  Similarly  if  S_ and  are  the  same,  1  will  be  the 

*  _i 

matrix  inverse  of  S.  Hence,  (SZg  )  =  I,  an  identity  matrix, 
whose  diagonal  elements  are  all  unities,  the  sun  of  which,  given  by 
the  trace  function,  equals  (m^  +  r^) ,  the  number  of  manifest  variables. 

Thus,  the  difference  between  tr(S_  £0~*)  and  (rr^  +  n^)  will 

equal  zero  in  this  case.  When  ML  equals  zero,  it  is  at  an  absolute  minimum. 

A 

This  absolute  minimum  is  never  attained  when  S  and  are  different, 

but  it  may  be  possible  to  find  estimates  for  the  free  and  constrained 
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elements  of  B,  r*,  and  £  that  make  2^  as  much  like  S  as  possible 
according  to  this  criterion. 

Or  one  can  use  the  loss  function  of  generalized  least  squares,  in  which 
case  one  would  minimize 

GLS  =  tr(I  -  S-1  2.) 2 

As  explained  previously  S  2^  will  be  an  identity  matrix  (I)  when 

a 

S  and  Ln  are  identical.  Hence  the  GLS  criterion  also  approaches  zero 
as  S  and  became  the  same. 

An  advantage  of  the  least  squares  (LS)  method  of  estimation  is  that  it 
may  be  implemented  without  making  any  distributional  assumptions  about  the 
observed  variables.  On  the  other  hand,  the  least  squares  method  of 
estimation  does  not  lend  itself  to  testing  the  statistical  significance  of 
differences  indicated  by  the  trace  function.  In  contrast,  statistical  tests 
are  possible  with  the  maximum  likelihood  and  the  generalized  least  squares 
methods,  lb  use  statistical  tests  with  the  maximum  likelihood  method,  the 
distribution  of  the  manifest  variables  should  follow  the  multivariate  normal 
distribution.  If  this  assumption  is  questionable,  then  the  generalized 
least  squares  (GLS)  procedure  may  be  used.  Generalized  least  squares  also 
makes  no  distributional  assumptions,  but  still  allows  for  statistical  tests. 
Finally,  it  is  extremely  important  to  note  that  statistical  inference  based 
on  maximum  likelihood  and  generalized  least  squares  estimates  is  possible 
only  with  large  samples.  One  reason  for  requiring  large  samples  is  that  the 
chi  square  statistics  used  to  test  the  significance  of  the  maximum 
likelihood  and  generalized  least  squares  loss  functions  are  only 
asymptotically  distributed  as  chi  square.  That  is  to  say,  the  sampling 
distributions  of  these  chi  square  statistics  only  approximate  chi  square 
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distributions  in  large  samples. 

The  algorithmic  procedures  for  obtaining  estimates  for  the  free  and 
constrained  elements  of  the  linear  structural  equation  model  with  latent 
variables  are  beyond  the  scope  of  this  treatment  (cf.  Jflreskog,  1973; 
Bentler  &  Weeks,  1980).  Che  should  know,  however,  that  the  procedures  are 
iterative  and  quite  time-consuming ,  even  with  large,  high-speed  computers, 
and  may  require  considerable  core  memory  as  well.  These  procedures 
estimate  all  of  the  unknown  (free  and  constrained)  elements  of  the  matrices 
B,  T*  and  $  at  the  same  time.  This  means  that  lack  of  fit  between  data  and 
a  model  can  be  spread  across  all  parameter  estimates  to  force  an  optimal 
fit.  The  result  is  a  tendency  to  obscure  which  fixed  elements  are  mainly 
responsible  for  the  lack  of  fit  of  the  model  to  data.  This  is  a  common 
problem  for  all  "full- information"  estimation  techniques;  that  is, 
techniques  that  estimate  all  free  and  constrained  parameters  in  a  model 
simultaneously  (cf.  Johnston,  1972).  On  the  other  hand,  if  the  conditions 
for  confirmatory  analysis  have  been  reasonably  satisfied,  then  the  estimates 
furnished  by  full- information  techniques  are  both  consistent  and  efficient. 
Unfortunately,  applications  of  latent  variable  structural  modeling  in 
psychology,  like  those  for  manifest  variables,  tend  to  be  extremely  weak  in 
regard  to  attention  to  the  conditions  essential  for  confirmatory  analysis, 
especially  in  connection  with  the  unmeasured  variables  problem  (Cliff,  Note 
1) .  we  will  discuss  these  points  later  in  this  section. 

Computer  programs.  A  number  of  computer  programs  are  available  for 
computing  the  estimates  of  the  parameters  of  structural  equation  models. 

The  program  LISREL  IV  prepared  by  Jflreskog  and  SSrbom  (1978)  is  perhaps 
the  most  widely  available  program  at  this  time  for  structural  models  with 
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latent  variables.  However,  its  use  of  the  notational  distinction  between 
the  measurement  model  and  the  structural  equation  model  is  unnecessary  and 
leads  to  difficulties  in  treating  the  case  where  a  manifest  variable  is 
regarded  as  simultaneously  an  effect  of  both  latent  endogenous  and  latent 
exogenous  variables.  Furthermore,  in  estimating  variances  of  disturbance 
variables,  LISREL  IV  frequently  produces  "Heywood"  cases  where  the  variance 
estimates  are  negative.  However,  a  program  that  implements  the  structural 
equation  model  used  in  this  book  would  not  have  Heywood  cases  when 
estimating  the  variances  of  the  disturbance  variables,  for  these  could  be 
fixed  arbitrarily  to  unity.  The  relative  contribution  of  a  disturbance 
variable  6^  would  then  be  indicated  by  the  square  of  the  structural 
parameter  which  is  unaffected  by  the  sign  of  C06AN,  a  progran 

informally  distributed  by  R.  P.  McDonald  based  on  a  general  model  for 
covariance  structure  modeling  (McDonald,  1978),  can  be  easily  used  with  the 
notation  used  in  this  book.  COSAN  has  the  additional  feature  of  permitting 
both  maximum- likelihood  and  generalized  least  squares  solutions  for 
parameter  estimates.  Bentler  is  also  developing  a  program  to  implement  the 
model  of  Bentler  and  Weeks  (1980),  which  should  be  readily  adaptable  to  the 
notation  of  this  book.  At  this  writing  Jflreskog  and  Sflrban  (1981)  have 
just  released  version  V  of  the  LISREL  program.  Jflreskog  states  (personal 
communication)  that  LISREL  V  will  be  distributed  to  replace  all  versions  of 
LISREL  IV  now  in  use.  Although  LISREL  V  still  does  not  provide  for 
general ized  least  squares  estimation  and  adheres  to  the  model  equation  of 
LISREL  IV  with  attendant  problems  with  Heywood  cases  in  estimation  of 
variances  of  disturbance  terms,  the  main  improvsnent  has  been  a  10- fold 
reduction  in  computing  time  required  to  obtain  maximum  likelihood  estimates 
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of  the  free  parameters  of  a  structural  equation  model.  This  has  been 
brought  about  through  the  use  of  an  improved  approach  to  determining 
initial  values  for  estimates  of  the  parameters  from  vhich  to  begin  the 
iterative  process.  The  improvement  in  computing  speed  should  make  LISREL  V 
very  popular  in  spite  of  the  minor  difficulties  we  have  mentioned. 

We  continue  now  with  the  question  of  significance  tests  for  the 

A 

goodness  of  fit  between  2  and  Z.  using  S  and  Zn. 

Significance  Tasting  and  Goodness  of  Fit  Tests. 

Testing  the  support  for  a  model.  Previously  in  connection  with 
Conditions  9  and  10  in  Section  2  we  discussed  how  to  evaluate  the  empirical 
support  for  a  confirmatory  model  based  on  manifest  variables.  Condition  9, 
we  said,  concerns  testing  whether  certain  causal  parameters  are  different 
from  zero.  It  was  pointed  out  that  causal  relations  are  indicated  by 
nonzero  structural  parameters  in  structural  equations.  Thus,  empirical 
support  for  causal  relationships  is  given  by  statistical  tests  that  reveal 
that  the  corresponding  estimates  of  free  structural  parameters  differ 
significantly  from  zero.  Condition  10,  on  the  other  hand,  concerned  the 
fit  of  a  model  to  data.  This,  we  pointed  out,  can  be  assessed  by  taking 
advantage  of  the  fact  that  the  estimated  and  fixed  structural  parameters  of 
a  structural  equation  model  determine  an  estimate  of  a  hypothetical 
variance-covariance  (correlation)  matrix  for  the  manifest  variables  under 
the  assumption  that  the  model  is  valid.  If  the  estimate  of  the 
hypothetical  variance-covariance  matrix  derived  from  the  fixed  and 
estimated  free  parameters  of  the  structural  equations  does  not  differ 
significantly  from  the  observed  (unrestricted)  estimate  of  the 
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variance-covariance  matrix  for  the  manifest  variables,  then  we  say  that  the 
model  fits  or  is  consistent  with  the  data. 

In  the  case  of  structural  equation  models  with  latent  variables,  we 
can  perform  tests  to  determine  whether  both  Conditions  9  and  10  are 
satisfied.  However,  how  one  performs  these  tests  will  be  constrained  by 
the  kinds  of  statistical  tests  that  are  possible  with  latent  variable 
models.  As  described  briefly  above,  all  of  the  available  algorithms  for 
estimating  the  free  parameters  of  a  structural  equation  model  use 
full- information  methods  of  estimation.  This  means  all  free  and 
constrained  parameters  of  the  model  are  estimated  simultaneously.  These 
full- information  algorithms  usually  provide  three  kinds  of  statistical 
tests  to  test  hypotheses  about  the  parameters  of  a  model:  (a)  confidence 
interval  tests  applied  to  estimates  of  individual  free  parameters  of  the 
model,  (b)  tests  comparing  nested  models,  and  (c)  tests  of  overall  goodness 
of  fit  of  ZQ  to  S.  The  first  two  ways  of  testing  statistical 
hypotheses  lend  themselves  to  methods  for  assessing  whether  Condition  9  is 
satisfied,  that  is,  whether  certain  free  structural  parameters  are  equal  to 
zero  or  not.  The  third  way  provides  a  basis  for  assessing  whether 
Condition  10  concerning  the  goodness  of  fit  of  the  model  to  data  is 
satisfied.  However,  ways  (b)  and  (c)  are  special  cases  of  the  same  chi 
square  goodness-of-fit  test  and  will  be  discussed  together,  although  they 
concern  different  substantive  questions. 

Tests  of  individual  parameters.  The  latent  variable  literature 
contains  little  information  about  the  use  of  confidence  intervals  to  test 
hypotheses  about  individual  free  parameters.  Nevertheless,  confidence 
interval  tests  are  possible  if  parameter  estimates  are  obtained  by  either 
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full- information  maximum-likelihood  or  generalized  least-squares 
estimation.  The  tests  are  based  on  estimates  of  the  standard  errors  of  the 
free  parameter  estimates  derived  from  the  "information  matrix",  which  is 
generated  as  a  byproduct  of  the  estimation  algorithm  (cf.  Bentler  &  Weeks, 
1980;  Jdreskog,  1973).  The  standard  errors  for  the  free  parameter 
estimates,  however,  are  conditional  standard  errors;  they  are  conditional 
on  the  fixed  and  constrained  parameters  of  the  model.  Thus,  confidence 
interval  tests  of  whether  free  parameters  associated  with  expected  causal 
pathways  differ  significantly  from  zero  will  be  meaningful  and  unbiased 
only  if  the  specifications  regarding  fixed  and  constrained  parameters  have 
been  determined  to  have  an  acceptable  goodness  of  fit  to  the  data.  In  any 
case,  one  has  estimates  of  standard  errors  and  consequently  confidence 
interval  tests  for  just  those  free  parameters  of  interest  when  assessing 
satisfaction  of  Condition  9. 

The  application  of  these  confidence  interval  significance  tests  should 
be  straightforward.  For  each  free  parameter  of  the  model  one  can  construct 
a  confidence  interval  around  the  estimate  of  the  parameter  using  the 
estimate  of  the  standard  error  of  the  parameter.  If  the  confidence 
interval  does  not  include  zero,  then  one  can  conclude  that  the  parameter 
differs  significantly  from  zero,  thereby  indicating  a  causal  connection 
between  the  variables  in  question. 

However,  when  the  researcher  seeks  to  test  hypotheses  about  a  number 
of  different  paraneters  of  the  model  with  a  series  of  tests  (i.e.  one  test 
per  free  parameter) ,  he  or  she  may  want  to  control  for  the  probability  of 
making  at  least  one  Type  I  error  of  incorrectly  rejecting  a  null  hypothesis 
when  it  is  true,  over  the  series  of  tests.  Tb  do  this  the  researcher  will 
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have  to  have  a  wb y  to  compute  this  probability.  This  can  be  done  exactly 
if  one  knows  the  tests  are  mutually  independent  and  the  level  of 
significance  of  each  test.  But  in  most  situations  one  will  have  a  series 
of  tests  of  parameters  estimated  from  a  single  sample,  and  the  tests  will 
not  be  mutually  independent.  It  follows  that  the  conditional  probability 
of  making  a  Type  I  error  on  a  test  given  one  has  already  made  a  Type  I 
error  on  a  previous  test  will  not  be  given  by  the  nominal  level  of 
significance  used  for  the  test.  A  number  of  general  procedures  are 
available  for  dealing  with  this  situation,  many  of  which  revolve  around  the 
Bonferroni  method  of  defining  more  stringent  levels  of  significance  for 
individual  tests  of  significance.  These  procedures,  which  are  reviewed  in 
Larzelere  and  Mulaik  (1977)  and  Stavig  (1981),  have  the  major  drawback  of 
being  less  powerful  than  procedures  that  involve  a  series  of  tests  known  to 
be  statistically  independent.  Nevertheless,  power  may  not  be  a  major 
concern  with  the  large  samples  required  for  goodness-of-fit  testing  with 
the  latent  variable  methods  .  The  alternative  to  using  confidence 
intervals  for  testing  hypotheses  about  individual  parameters  is  based  on 
the  overall  goodness-of-fit  tests.  We  will  first  describe  the 
goodness-of-fit  test  and  then  return  to  the  alternative  procedure  for 
testing  individual  parameters.  (Before  departing  from  this  point,  we 
should  mention  that  the  logic  described  above  for  controlling  for  Type  I 
errors  in  latent  variable  models  applies  also  to  manifest  variable  models) . 

Goodness-of-Fit  Tests.  After  one  has  obtained  maximum  likelihood 
estimates  of  the  free  and  constrained  parameters,  he  or  she  can  then 
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generate  the  estimated  model  variance-covariance  matrix  and 
compare  this  for  goodness  of  fit  with  the  sample  variance-covariance 
matrix  S  estimated  in  the  usual  way  under  no  restrictions.  Tb  do  this 
one  can  compute  the  statistic 

F  =  N  [log | ZQ |  -  loglSl  +  trfS^-1)  -  (n^  +  n2)]/2 
which,  when  the  sample  size  N  is  very  large,  is  approximately  distributed 
as  chi  square  with  (n^  +  n^)  (n^  +  ^  +  l)/2  -  _s  degrees  of  freedom, 
s  is  the  number  of  independent  free  and  distinct  constrained  parameters 
of  the  model.  If  the  sample  value  of  this  statistic  exceeds  the  critical 
value  of  chi  square  with  the  respective  degrees  of  freedom  at  the  specified 
level  of  significance,  the  model  is  rejected  for  lack  of  fit  to  the  data. 
However,  one  must  keep  in  mind  that  this  test  actually  concerns  a  test  of 
the  appropriateness  of  the  values  chosen  for  the  parameters  fixed  to 
specify  the  model.  That  is  to  say,  the  statistical  hypothesis  tested  by 
the  overall  goodness-of-fit  test  is  that  the  fixed  parameters'  values  are 
what  they  have  been  fixed  to.  The  test  does  not  test  hypotheses  about  the 
free  parameters  of  the  model;  these  have  been  estimated  in  such  a  way  as  to 
make  the  fit  of  the  model  to  the  data  as  close  as  possible,  conditional  on 
the  values  of  the  fixed  parameters.  And  so,  any  lack  of  fit  will  be  a 
result  of  the  fixed  parameter  values  and  not  the  free  parameters.  Thus,  if 
the  test  yields  a  significant  goodness-of-fit  statistic,  one  then  must 
consider  that  Condition  10  is  not  satisfied,  that  one's  specification  of 
the  model  in  terms  of  fixed  and  constrained  parameters  is  possibly  not 
correct.  Ch  the  other  hand,  if  the  overall  goodness-of-fit  test  statistic 
is  not  significant,  it  means  that  Condition  10  is  satisfied,  that  is,  the 
specification  of  the  model  is  consistent  with  the  data. 
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Tests  of  ind ividual  parameters  using  the  F  statistic.  One  can  use 
the  above  chi  square  goodness-of-fit  statistic  F  to  test  hypotheses  about 
individual  (and  even  groups  of)  parameters  in  a  model.  Thus,  this 
statistic  can  also  be  used  to  assess  whether  Condition  9  concerning 
hypotheses  about  the  free  parameters  of  the  structural  model  has  been 
satisfied,  lb  apply  the  chi  square  goodness-of-fit  statistic  F  for  this 
purpose,  one  must  make  a  comparison  between  two  models:  one  with  the 
parameter  in  question  fixed  to  the  hypothesized  value,  the  other  a  model 
that  is  identical  to  the  first  model  except  that  the  parameter  (or 
parameters)  in  question  is  (are)  free  to  be  estimated.  The  model  with  the 
value  of  the  parameter(s)  fixed  by  hypothesis  will  ordinarily  yield  a 
larger  chi  square  statistic  than  the  one  with  the  same  parameter(s) 
allowed  to  be  free.  The  difference  between  these  two  chi  squares  is  also  a 
chi  square  statistic,  known  as  the  difference  chi  square,  with  degrees  of 
freedom  equal  to  the  difference  in  degrees  of  freedom  between  the  original 
chi  squares.  If  the  difference  chi  square  is  significant,  one  rejects  the 
null  hypothesis  about  the  fixed  parameter (s) .  This  approach  is  illustrated 
below. 

A  priori  sequences  of  tests.  Bentler  and  Bonnet  (1980)  have  recently 
reviewed  the  problem  of  conducting  a  series  of  independent  statistical 
tests  about  individual  parameters  of  a  structural  equation  model  with 
latent  variables.  Ttie  purpose  of  such  a  series  of  tests  is  not  to  explore 
data,  but  to  test  a  number  of  a  priori  hypotheses  in  connection  with  the 
parameters  of  a  particular  model  included  within  a  nested  sequence  of 
models.  According  to  Boy  (1978)  one  can  construct  a  series  of  independent 
statistical  tests  of  individual  parameters  by  specifying  an  a  pr iori 
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sequence  of  nested  models  whose  goodness  of  fit  to  the  data  is  to  be 
tested,  beginning  first  with  a  test  of  the  goodness  of  fit  of  an  initial 
over identified  model.  Each  subsequent  model  to  test  must  differ  from  the 
previous  model  only  in  having  additional  fixed  parameters  over  and  beyond 
those  of  the  previous  model.  Each  model,  therefore,  is  a  specialization  of 
the  previous  model  in  having  the  previous  models'  same  fixed  parameters  as 
well  as  additional  ones.  Cnee  the  a  priori  sequence  of  models  has  been 
specified,  then  a  sequence  of  statistical  tests  is  performed  as  follows: 

For  such  a  sequence  of  models,  let  E\  and  F.+1  be  the  chi 
square  values  with  degrees  of  freedom  d^  and  d^,  respectively, 
for  the  goodness-of-fit  tests  of  models  i_  and  _i+l,  respectively.  Then 
the  test  of  the  effect  of  not  fixing  the  additional  parameters  in  the  £th 
model,  as  is  done  in  the  (_i  +  l)st  model,  is  given  by  the  difference  chi 
square  value  _Fj  =  E\+^  -  F .  ,  which  has  d  =  (d^+1  -  d^)  degrees  of 
freedom.  This  sequence  of  tests  continues  until  one  rejects  the  null 
hypothesis,  lb  go  on  beyond  this  point  with  tests  of  subsequent  models 
would  be  logically  inconsistent,  because  subsequent  models  would  contain 
parameter  values  that  had  been  rejected  in  some  previous  test.  With  such  a 
series  of  tests  one  could  test  the  specification  of  a  model  (condition  10) 
and  then  examine  the  causal  connections  between  variables  (condition  9) . 
(See  also  the  discussion  in  Mulaik  (1972)  ,  pp.  411-417  or  Larzelere  and 
Mulaik  (1977)  for  the  description  of  analogous  tests  in  multivariate 
step-down  regression)  . 

It  must  be  pointed  out  that  to  achieve  a  series  of  independent 
statistical  tests  with  an  £  pr iori ,  nested  sequence  of  models,  one 
pays  the  price  of  being  constrained  to  testing  hypotheses  about  parameters 
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in  the  order  in  which  they  are  fixed  in  specifying  the  nested  sequence  of 
models.  Furthermore,  because  the  sequence  of  statistical  tests  ends  when 
a  null  hypothesis  (about  the  fixed  values  of  one  or  more  parameters)  has 
been  rejected,  it  is  essential  to  arrange  the  testing  of  hypotheses  about 
parameters  in  the  order  of  their  relative  importance. 

Thus,  in  defining  a  sequence  of  models  so  one  can  test  hypotheses 
about  different  parameters  of  a  structural  equation  model,  one  will  first 
test  hypotheses  about  those  parameters  that,  when  fixed,  specify  a 
measurement  model  relating  the  manifest  variables  of  the  model  bo  different 
(usually  latent)  hypothetical  construct  variables.  Next,  if  the  previous 
hypotheses  are  accepted,  one  will  test  hypotheses  about  those  parameters 
that,  when  fixed — in  addition  to  the  previously  fixed  values — specify  an 
over identified  causal  structure  among  the  hypothetical  construct  variables 
of  the  model.  Then,  if  the  values  of  previously  fixed  parameters  are 
consistent  with  the  data,  one  will  go  on  to  test  vrtiether  further  fixing  to 
zero  the  values  of  the  structural  parameters  linking  hypothetical  construct 
variables  to  one  another  is  consistent  with  the  data.  If  so,  one  will 
reject  the  causal  structure  among  the  hypothetical  construct  variables  and 
then  test  whether  additionally  fixing  to  zero  the  structural  parameters 
relating  the  manifest  indicator  variables  to  the  hypothetical  construct 
variables  is  consistent  with  the  data.  This  last  test  is  a  test  of  whether 
any  structure  exists  at  all  among  the  manifest  indicator  variables.  We 
will  now  illustrate  a  nested  sequence  of  models  in  which  such  a  series  of 
tests  may  be  performed. 


A  nested  sequence  of  models.  To  illustrate  how,  in  connection  with 
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the  model  in  Figure  4.9,  one  may  construct  a  priori  a  nested  sequence  of 
models,  each  subsequent  model  more  restricted  than  the  previous,  consider  the 
structural  diagrams  in  Figures  4.12  -  4.16.  Figure  4.12  is  a  diagram  of  a 
just  identified  model  for  the  13  manifest  variables  of  the  model  in  Figure 
4.9.  According  to  this  model,  12  of  the  13  manifest  variables  are  dependent 
on  nine  exogenous  latent  common  factor  variables  plus  12  disturbance 
variables.  Three  of  the  latent  variables,  6^,  V^r  and  >]2,  correspond 
to  the  same- labelled  latent  variables  of  the  model  in  Figure  4.9.  The 
remaining  six  latent  variables,  5,  through  (represented  in  Figure  4.12 
by  slightly  smaller  circles  than  those  associated  with  5-^,  T)^,  and  1)2) 
are  included  only  to  illustrate  a  just  identified  model  that  contains  as  a 
specialization  the  more  restricted,  over identified  model  in  Figure  4.9.  The 
thirteenth  manifest  variable  (x2)  in  Figure  4.12  is  a  perfectly  reliable 
exogenous  variable.  Under  the  assumption  that  all  latent  variables  are 
specified  to  have  unit  variances,  the  number  of  free  parameters  to  estimate 
in  this  model  is  91:  (a)  12  structural  coefficients  relating  disturbances  to 

manifest  variables,  (b)  45  independent  covariances  among  the  9  latent 
exogenous  variables  and  the  single  manifest  exogenous  variable  x^,  (c)  the 
variance  of  x2,  and  (d)  33  structural  coefficients  relating  the  9  latent 
exogenous  variables  to  the  12  manifest  endogenous  variables.  Since  the 
numbe:  of  free  parameters  to  be  estimated  (each  corresponding  to  one  of  the 
single  or  two-headed  arrows  in  the  diagram)  equals  the  number  of  variances 
and  covariances  in  the  population  covariance  matrix  for  the  13  manifest 
variables,  the  model  is  just  identified.  The  model's  covariance  matrix  £q 
will  fit  the  population  covariance  matrix  Z  perfectly. 
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Insert  Figure  4.12  about  Here 


Figure  4.13  is  a  more  restricted,  over- identified  model  representing 

the  model  to  use  to  test  hypotheses  about  the  specified  parameters  of  the 

measurement  model  portion  of  the  model  in  Figure  4.9.  In  this  model  a  causal 

structure  is  specified  relating  the  manifest  variables  ^  through 

to  the  latent  variables  ?  ,  2)^  and  Q  of  the  model  in  Figure  4.9.  But, 

in  contrast  to  the  latent  structural  model  in  Figure  4.9,  no  causal  structure 

is  specified  in  Figure  4.13  among  the  variables  E^,  3.  and 

The  model  simply  allows  these  variables  to  be  correlated,  but  in  an 

unspecified  way  and  indicates  this  by  the  double-headed  arrows  between  these 

variables  in  Figure  4.13.  The  reason  the  model  of  Figure  4.13  is  a 

restriction  of  the  just  identified  model  in  Figure  4.12  is  because  six  of  the 

nine  latent  exogenous  (common  factor)  variables  of  the  just  identified  model 

have  been  effectively  deleted  from  the  model.  This  has  been  accomplished  by 

specifying  that  the  structural  coefficients  linking  the  six  latent  exogenous 

variables  with  the  12  manifest  variables  in  Figure  4.12  are  zero  and  that  the 

) 

covariances  of  the  six  latent  variables  with  the  other  manifest  and  latent 
\ 

exogenous  variables  are  zero  also.  In  other  words,  numerous  parameters  of 
the  model  in  Figure  4.12  have  been  restricted  or  fixed  to  equal  zero. 


Insert  Figure  4.13  about  Here 


A  test  of  the  specification  of  the  fixed  parameters  of  the  model  in 


Figure  4.13  is  then  given  by  a  comparison  of  the  goodness  of  fit  statistic 
obtained  for  the  model  in  Figure  4.13  to  the  goodness  of  fit  statistic 
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obtained  for  the  just  identified  model  in  Figure  4.12  (which  is  equal  to 
zero) . 

Figure  4.14  is  a  structural  equation  model  developed  from  the 

measurement  model  in  Figure  4.13  by  specifying  a  causal  structure  among  the 

variables  ^  and  ^2*  In  ^acfc'  ^  *s  t^ie  m°del  described 

previously  in  connection  with  Figure  4.9.  In  this  case  latent  exogenous 

variable  and  manifest  exogenous  variable  x^  are  seen  to  be  direct 

causes  of  only  while  ^  is  a  direct  cause  of  *]2.  Exogenous 

variables  ^  and  x^  are  also  presumed  to  be  unrelated.  The  latent 

structural  model  in  Figure  4.14  is  more  restricted  than  the  model  in 

Figure  4.13.  If  in  Figure  4.14  we  allowed  and  x^  to  covary, 

and  included1  causal  arrows  from  to  ]02,  and  from  x2  to  ^2'  t*ien 

the  resulting  model  would  be  no  more  restricted  than  the  measurement  model  in 

Figure  4.13  and  would  fit  the  data  as  well  as  the  model  in  Figure  4.13.  In 

other  words,  we  would  then  have  just  as  many  free  parameters  to  estimate  in 

the  model  of  Figure  4.14  as  we  do  in  the  model  of  Figure  4.13.  By  fixing 

certain  parameters  equal  to  zero  in  the  model  of  Figure  4.14  (a  =  0, 

2 

Y^2  ~  0#  J?22  =  0),  we  have  overidentif ied  the  structural  parameters 
associated  with  causal  relationships  among  the  variables  >c2,  ^  and 
>)2.  Thus  the  difference  in  the  goodness  of  fit  between  the  model  in  Figure 
4.14  and  the  model  in  Figure  4.13  is  the  basis  for  a  test  of  the  parameters 
fixed  to  specify  an  over  identified  causal  structure  among  the  exogenous  and 
latent  endogenous  variables  of  the  model  in  Figure  4.9. 


Insert  Figure  4.14  about  Here 
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The  structural  diagram  in  Figure  4.15  is  of  an  even  more  restricted 
model  in  v^iich  the  causal  paths  between  the  latent  variables  of  the 
previous  model  in  Figure  4.14  have  been  deleted.  In  other  words,  the 
structural  coefficients  associated  with  causal  connections  among  the  latent 
variables  and  the  manifest  exogenous  variable  in  Figure  4.14  have  all 
been  fixed  to  zero.  A  comparison  of  the  goodness  of  fit  of  this  model  with  the 
goodness  of  fit  of  the  model  in  Figure  4.14  would  be  the  basis  for  determining 
whether  the  causal  structure  between  the  exogenous  and  latent  endogenous 
variables  of  the  model  in  Figure  4.14  exists  or  not. 


Insert  Figure  4.15  about  Here 


Finally  the  diagram  in  Figure  4.16  is  of  the  most  restricted  model  of 
all  in  this  sequence.  No  structure  is  hypothesized  to  exist  among  the 
manifest  variables.  This  model  we  will  call  the  "null  model",  because  it 
hypothesizes  that  the  variables  are  all  mutually  uncorrelated. 


Insert  Figure  4.16  about  Here 


Now,  a  sequence  of  statistical  tests  may  be  performed  as  follows: 

First  compare  the  just  identified  model  of  Figure  4.12  with  the  more 
restricted  measurement  model  in  Figure  4.13.  The  test  of  the  difference 
between  these  two  models  is  given  by  the  chi  square  test  of  the  goodness  of 
fit  of  the  model  in  Figure  4.13  to  the  data.  If  one  accepts  the  null 
hypothesis  (that  the  model  in  Figure  4.13  fits  the  data) ,  one  can  then  go 
on  to  perform  subsequent  tests;  otherwise,  one  must  stop,  since  any 
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subsequent  model  would  have  fixed  parameters  whose  values  were  already 
rejected  in  testing  the  first  over  identified  model. 

Given  acceptance  of  the  model  in  Figure  4.13,  one  can  then  compare  the 

model  in  Figure  4.14  with  the  model  in  Figure  4.13.  The  model  in  Figure  4.14, 

being  the  more  restricted,  becomes  the  null  hypothesis.  The  chi  square 

statistic  to  use  is  the  difference  between  the  chi  square  for  the 

goodness  of  fit  of  the  model  in  Figure  4.14  and  the  chi  square  for  the 

goodness  of  fit  of  the  model  in  Figure  4.13.  As  noted  earlier,  the  degrees 

of  freedom  for  the  chi  square  statistic  comparing  these  two  models  is  the 

difference  in  the  degrees  of  freedom  between  the  two  goodness-of-fit  chi 

squajre  statistics.  If  one  rejects  the  null  hypothesis,  that  is,  rejects 

the  values  of  the  parameters  fixed  to  specify  an  over identified  structure 

among  the  latent  variables,  one  must  stop  further  statistical  testing  and 

accept  the  model  in  Figure  4.13.  Acceptance  of  the  model  in  4.14  suggests 

confirmation  of  the  hypothesis  regarding  fixed  values  in  the  structural  part 

of  the  model.  That  is,  jv.  and  x  are  unrelated  (a  =  0)  ,  and, 

1  1  -1-2 
further,  neither  are  direct  causes  of  (V^  =  0,  =  0). 

On  the  other  hand,  if  one  accepts  the  null  hypothesis  (meaning  the 
model  in  Figure  4.14  has  acceptable  fit) ,  one  can  then  go  on  and  test  the 
difference  between  the  model  in  Figure  4.15  and  the  model  in  Figure  4.14, 
which  is  equivalent  to  a  test  of  whether  the  structural  coefficients, 
y^l»  )^2»  an^  9^21'  representing  causal  connections  among  the  manifest 
exogenous  and  the  latent  variables  of  the  model  in  Figure  4.14,  differ 
significantly  from  zero.  As  before,  the  chi  square  statistic  to  use  for 
this  test  is  the  difference  chi  square  equal  to  the  difference  between 
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the  goodness-of-fit  chi  square  statistics  obtained  for  the  models  in 
Figures  4.14  and  4.15,  respectively.  The  difference  chi  square  has  degrees 
of  freedom  equal  to  the  difference  between  the  degrees  of  freedom  of  the 
two  goodness-of-fit  chi  squares.  Again,  if  the  difference  chi  square  test 
of  the  difference  between  these  models  is  significant,  one  must  stop  and 
perform  no  further  significance  testing.  In  that  case  one  accepts  the 
structural  model  in  Figure  4.14. 

If  the  difference  chi  square  test  comparing  the  models  in  Figures  4.14 
and  4.15  is  not  significant,  then  one  can  make  one  final  test.  This  is  a 
test  of  the  pathways  from  the  latent  variables  to  the  observed  variables. 

Do  their  structural  parameters  all  simultaneously  equal  zero?  If  they  do, 
then  there  should  be  no  structure  among  the  observed  variables.  Thus  one 
would  test  the  difference  between  the  model  in  Figure  4.15  and  the  model  in 
Figure  4.16.  (Actually  there  is  the  possibility  for  intermediate  models 
between  the  four  independent  clusters  model  in  Figure  4.15  and  the 
no-structure-among-variables  model  in  Figure  4.16:  One  might  have  a  model 
in  which  just  the  third  common  factor  is  eliminated,  leaving  variables 
111/  .*.'^14  mutu0lly  uncorrelated  as  well  as  uncorrelated  with  the 
other  manifest  variables.  A  more  restricted  model  would  eliminate  the 
second  and  third  factors,  leaving  the  last  nine  variables  mutually 
uncorrelated  as  well  as  uncorrelated  with  the  firsc  three  variables,  which 
are  assumed  to  have  a  single  common  factor)  .  As  before,  one  performs  a  chi 
square  test  based  on  the  difference  between  the  goodness-of-fit  chi 
squares  of  the  models  in  Figures  4.15  and  4.16,  respectively.  If  one 
rejects  the  null  hypothesis  (that  the  no-structure  model  in  Figure  4.16 
is  correct) ,  one  accepts  the  previous  model  of  Figure  4.15. 
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The  Normed  Fit  Index.  There  is  one  serious  drawback  to  the  use  of 
the  chi  square  goodness-of-fit  statistic.  Just  at  that  point  inhere  the 
sample  size  becomes  large  enough  so  that  the  goodness-of-fit  statistic 
becomes  distributed  with  reasonable  approximation  to  the  chi  square 
distribution,  it  also  has  sufficient  power  to  detect  even  minute 
departures  of  the  data  from  the  model.  One  will  find  with  real-world  data 
that  he  or  she  rejects  the  fit  of  the  model  almost  every  time,  even  when 

A 

an  examination  of  the  residual  matrix  (S  -  Z_)  reveals  most  elements 
of  this  matrix  are  small  in  magnitude.  On  the  other  hand,  Bentler  and 
Bonnett  (1980)  point  out  that  if  one  uses  this  statistic  with  small 
samples,  he  or  she  will  almost  always  accept  the  fit  of  a  model  to  the 
data  for  almost  any  model.  Consequently,  one  would  hope  to  find  a  way  of 
assessing  how  well  the  model  captures  most  of  the  information  in  the 
sample  covariance  matrix  that  is  independent  of  sample  size. 

Bentler  and  Bonnet  (1980)  proposed  an  index  they  call  the  normed  fit 
index.  This  index  gives  the  relative  decrease  in  lack  of  fit  between 
two  nested  models,  one  less  restricted  than  the  other.  The  decrease  in 
lack  of  fit  in  going  from  the  more  restricted  to  the  less  restricted  model 
is  compared  to  a  norm.  This  jiorm  is  defined  on  a  nested  sequence  of 
increasingly  less  restricted  models  that  begins  with  a  most  restricted 
"null"  model,  is  followed  by  the  original  two  nested  models,  and  ends  with 
a  just  identified  model.  The  norm  for  comparison  is  the  decrease  in  lack 
of  fit  in  going  from  the  "most  restricted"  model  at  one  end  of  the 
sequence  to  a  just  identified  model  (vrtiich  necessarily  has  no  lack  of  fit) 
at  the  other  end  of  the  sequence.  The  most  restricted  model  at  the  most 


restricted  end  of  the  sequence  would  fix  all  free  parameters  not  fixed  in 
the  other  models  in  the  nested  sequence.  In  many  (but  not  all)  instances 
the  most  restricted  model  is  the  "null"  model  that  predicts  no  relations 
between  any  of  the  manifest  variables.  In  other  words,  under  this  most 
restricted  "null"  model,  the  variance-covariance  matrix  of  the  observed 
variables  would  be  hypothesized  to  be  a  diagonal  matrix,  with  all  off 
diagonal  elements  equal  to  zero. 

For  exanple,  consider  the  nested  sequence  of  models  in  going  from  the 
model  in  Figure  4.16  down  to  the  model  in  Figure  4.12.  The  model  in 
Figure  4.16  is  the  "null"  model  of  this  sequence.  Figure  4.12  displays 
the  just  identified  model.  The  difference  in  lack  of  fit  between  the  null 
model  in  Figure  4.16  and  the  just  identified  model  in  Figure  4.12 
represents  the  maximum  possible  difference  in  lack  of  fit  between  any  two 
models  in  this  sequence  of  nested  models.  And  so,  we  can  use  this  maximum 
possible  difference  in  lack  of  fit  as  a  norm  against  which  the  difference 
in  lack  of  fit  between  any  two  models  within  the  sequence  can  be  compared. 

In  testing  the  fit  of  the  null  model,  the  goodness~of-fit  statistic  is 
denoted  and  is  based  on  a  comparison  of  the  differences  between  the 
zero  covariances  of  the  null  model  and  the  actually  observed  covariances 
in  the  sample  covariance  matrix  S.  Now,  designate  the  most  restricted 
"null"  model  From  the  pair  of  models  in  question  let 

denote  the  least  restricted  model,  that  is,  the  model  that  has  the 
fewest  fixed  parameters  (and  yet  is  still  overidentified) .  Let 
denote  a  more  restricted  submodel  of  M^,  in  that  has  not  only 
the  same  fixed  parameters  of  but  other  fixed  parameters  as  well, 
meaning  it  has  fewer  free  parameters  to  estimate.  Then  the  normed  fit 
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index  is  given  by 

?ki  *  <4  -Q'lo  (i ■ 271 

where  _F  is  any  fit  function  (LS,  ML,  GLS)  evaluated  for  the  respective 
models.  (The  index  works  with  any  method  of  estimation  using  a  fit 
function  which  is  to  be  minimized  conditional  on  the  fixed  parameter 
values  and  which  takes  the  value  of  zero  when  fit  is  perfect.  Even  the 
LS  fit  function  can  be  used  since  probabilistic  inference  will  not  be 
involved  in  the  use  of  the  normed  fit  index) . 

Now,  a  special  case  of  the  index  given  in  Eq.  4.27  would  be  a 
comparison  of  the  lack  of  fit  of  a  given  model  to  the  lack  of 
fit  of  the  null  model  which  is  given  by 

301=  <Zo-!y/Io  u-28’ 

If  this  index  is  close  to  unity,  then  the  model  M^  has  captured 
most  of  the  information  about  relationships  between  the  observed  variables 
as  given  in  the  sample  variance-covariance  matrix.  This  index  may  be  used 
even  if  the  goodness-of-fit  test  for  using  the  chi-square  statistic 
is  significant,  meaning  a  statistical  lack  of  fit.  In  other  words,  the 
normed  fit  index  provides  a  nonstatistical  assessnent  of  the  adequacy  of  a 
model's  fit  to  data,  which  may  be  used  to  determine  whether,  on  practical 
grounds,  a  model  may  be  of  value  in  describing  a  particular  set  of  data. 

However,  a  drawback  of  the  normed  fit  index  is  that  the  index  takes 
no  account  of  the  reduction  in  degrees  of  freedom  in  going  from  the  null 
model  to  the  model  whose  fit  is  being  assessed.  For  example,  a  just 
identified  model  wath  zero  degrees  of  freedom  would  have  a  normed  fit 
index  of  mity.  And  a  barely  over  identified  model  with  just  a  few  degrees 
of  freedom  relative  to  the  degrees  of  freedom  of  the  null  model  would 
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typically  have  a  normed  fit  index  of  near  unity.  Cne  must  ask  how 
efficient  is  the  increase  in  fit  in  going  from  the  null  model  with  many 
:  degrees  of  freedom  to  another  model  with  just  a  few  degrees  of  freedom  in 

terms  of  the  degrees  of  freedom  lost  in  estimating  more  parameters? 

Khattab  and  Hocevar  (1982)  recommend  the  efficiency  index 

5=  l?0  -  4>/[Eo(^  *4"  (4-29) 

which  represents  the  per  paraneter  (or  degree  of  freedom)  average  increase 
in  fit,  where  d^  are  the  degrees  of  freedom  of  the  null  model,  d^ 
are  the  degrees  of  freedom  of  the  model  whose  fit  to  the  data  is  to 
be  assessed,  and  F^  and  are  defined  as  before. 

However,  the  efficiency  index  in  (4.29)  is  not  useful  for  making 
comparisons  between  models  in  general.  One  would  like  to  consider  how 
well  a  model  fits  data  in  terms  of  the  parsimony  of  the  model.  The 
parsimony  of  a  model  is  indicated  by  the  ratio  of  the  number  of  degrees  of 
freedom  in  the  model  to  the  number  of  degrees  of  freedom  available  in  the 
data  as  indicated  by  the  degrees  of  freedom  of  the  null  model  for  that 
data.  When  this  ratio  is  unity  then  all  information  in  the  data  available 
for  testing  the  null  model  is  also  available  for  testing  the  alternative 
model  in  question.  No  information  is  wasted  on  estimating  parameters. 

For  example,  consider  two  models,  and  Let  be  a 

model  applied  to  10  variables,  while  is  applied  to  30  variables. 

The  number  of  degrees  of  freedom  d^  in  the  null  model  for  the  10 
variable  case  is  (10  x  ll)/2  -  10  =  45.  The  number  of  degrees  of  freedom 
d^2  for  the  null  model  of  the  case  with  30  variables  is  (30  x  31)/2  - 
30  =  435.  Suppose  the  degrees  of  freedom  d^  of  the(  10  variable  model 
is  35.  Suppose  the  degrees  of  freedom  d2  of  "he  model  M2 
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typically  have  a  normed  fit  index  of  near  unity.  One  must  ask  how 
efficient  is  the  increase  in  fit  in  going  from  the  null  model  with  many 
degrees  of  freedom  to  another  model  with  just  a  few  degrees  of  freedom  in 
terms  of  the  degrees  of  freedom  lost  in  estimating  more  parameters? 

Khattab  and  Hocevar  (1982)  recommend  the  efficiency  index 
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which  represents  the  per  parameter  (or  degree  of  freedom)  average  increase 
in  fit,  where  d^  are  the  degrees  of  freedom  of  the  null  model,  d^ 
are  the  degrees  of  freedom  of  the  model  whose  fit  to  the  data  is  to 
be  assessed,  and  Fg  and  are  defined  as  before. 

However,  the  efficiency  index  in  (4.29)  is  not  useful  for  making 
comparisons  between  models  in  general.  One  would  like  to  consider  how 
well  a  model  fits  data  in  terms  of  the  parsimony  of  the  model.  The 
parsimony  of  a  model  is  indicated  by  the  ratio  of  the  number  of  degrees  of 
freedom  in  the  model  to  the  number  of  degrees  of  freedom  available  in  the 
data  as  indicated  by  the  degrees  of  freedom  of  the  null  model  for  that 
data.  When  this  ratio  is  unity  then  all  information  in  the  data  available 
for  testing  the  null  model  is  also  available  for  testing  the  alternative 
model  in  question.  No  information  is  wasted  on  estimating  parameters. 

For  example,  consider  two  models,  and  M2>  Let  be  a 

model  applied  to  10  variables,  while  m2  is  applied  to  30  variables. 

The  number  of  degrees  of  freedom  d^  in  the  null  model  for  the  10 
variable  case  is  (10  x  ll)/2  -  10  =  45.  The  number  of  degrees  of  freedom 
dg2  for  the  null  model  of  the  case  with  30  variables  is  (30  x  31)/2  - 
30  =  435.  Suppose  the  degrees  of  freedom  d^  of  tire  10  variable  model 
is  35.  Suppose  the  degrees  of  freedom  d.,  of  the  model  M2 


is  425.  Both  models  differ  from  their  respective  null  models  by  10 


degrees  of  freedom.  Yet  would  have  a  parsimony  of  425/435  =  .977 
vrtiile  the  parsimony  of  would  be  25/35  =  .71.  If  and 
both  had  the  same  value  for  the  normed  fit  index,  they  would  both  have  the 
same  efficiency  index  as  well,  but,  because  is  more  parsimonious,  its 
use  of  the  data  is  superior.  Thus,  we  recommend  multiplying  the  normed  fit 
index  by  the  parsimony  of  the  model  to  obtain  the  parsimonious  fit  index: 


<V4)>  tdo-41^ 


(4.30) 


This  index  corrects  for  the  high  normed  fit  index  values  obtained  by 
models  that  waste  data  in  estimating  parameters  to  get  good  fits  to  the 
data.  Note  that  a  just  identified  model  with  zero  degrees  of  freedom 
would  obtain  a  zero  parsimonious  fit  index  value  even  when  its  normed  fit 
index  value  is  unity. 

We  will  now  illustrate  the  use  of  the  normed  fit  indices  as  well  as 
estimation,  significance  testing,  and  nested  models  in  an  example  of  a 
structural  model  with  latent  variables. 


Example.  What  follows  is  a  series  of  analyses  based  on  an 
artificially  generated  sample  variance-covariance  matrix.  This  matrix  was 
designed  to  simulate  data  obtained  from  a  sample  of  800  observations  on  13 
manifest  variables  having  the  structural  relationships  of  the  model  in 
Figures  4.9  and  4.14.  This  sample  variance-covariance  matrix  is  shown  in 
Table  4.  3. 

Our  first  task  will  be  bo  evaluate  the  "measurenent  model"  part  of  the 
model  in  Figure  4.9  and  Figure  4.14.  In  this  phase  of  our  analysis  we  will 
attempt  to  answer  questions  such  as  the  following:  Have  we  selected 


Insert  Table  4 . 3  about  here 
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variables  that  behave  as  indicators  of  latent  variables  in  the  manner  that 

the  manifest  variables  of  the  model  in  Figure  4.9  and  Figure  4.14  do?  In 

other  words,  are  manifest  variables^'  ••••  aH  related  to  a 

single  canmon  latent  variable  or  factor?  Can  we  say  the  same  thing  about 

variables^,  . ..,  ^  or  variables  ...,^4?  Is 

variable^  uncorrelated  with  variables  ••  *»X]_4?  l^ese 

questions  can  be  answered  by  testing  the  model  shown  in  Figure  4.13.  To  do 

this  we  will  now  seek  maximun-likelihood  estimates  of  the  parameters  of 

the  model  in  Figure  4.13.^ 


Insert  Footnote  6  about  Here 


The  maximun  likelihood  estimates  along  with  the  estimated  standard 
errors  of  the  free  parameters  of  the  model  in  Figure  4.13  are  shown  in  the 
structural  diagram  in  Figure  4.17.  The  chi  square  test  for  goodness  of  fit 
with  61  degrees  of  freedom  yields  a  value  of  57.4074.  lhe  probability  of 
getting  a  chi  square  statistic  this  large  or  larger  with  61  degrees  of 
freedom  is  .6069.  Hence  the  goodness-of-fit  statistic  is  not  significant 
at  the  .05  level,  and  we  can  accept  the  model  as  consistent  with  the  data. 


Insert  Figure  4.17  about  Here 


Having  accepted  a  model  in  which  the  variables  are  divided  into 
homogeneous  clusters  in  the  expected  manner,  we  now  can  focus  on  the 
structure  among  the  latent  variables  of  that  model.  We  will  assume 
that  according  to  theory  the  latent  variables  of  the  model  in  Figure 
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4.17  have  a  structure  like  that  of  Figure  4.9  or  Figure  4.14.  In 
specifying  this  model  we  will  have  to  fix  the  values  of  the  two  structural 
parameters  relating  the  disturbance  variables  and  to  their 
corresponding  latent  endogenous  variables  ^  and  ^  just  so  the  metric 
of  these  endogenous  variables  will  be  determined.  In  Figure  4.18  we  show 
the  structural  diagram  of  this  model  again,  but  this  time  with 
max  imun-liKeli  hood  estimates  and  estimated  standard  errors  of  the  values  of 
the  structural  parameters  of  this  model.  The  chi  square  statistic  for  this 
model  has  64  degrees  of  freedom  and  a  value  of  57.6570.  Ihe  probability  of 
getting  a  chi  square  with  64  degrees  of  freedom  this  large  or  larger  is 
.6988.  Again  the  ch  square  statistic  is  not  significant  at  the  .05  level, 
and  so  we  continue  to  accept  the  model.  The  difference  chi  square  for  the 
comparison  of  the  models  in  Figures  4.13  and  4.14  is  57.6570  -  57.4074  = 

.2496  with  64  -  61  =  3  degrees  of  freedom.  This  is  not  significant.  Hence 
the  specification  of  the  causal  structure  among  the  exogenous  and  latent 
endogenous  variables  is  consistent  with  the  data. 

Insert  Figure  4.18  about  Here 

Because  the  chi  square  goodness-of-fit  statistic  is  not  significant, 
we  may  now  test  hypotheses  about  various  free  parameters  of  the  model 
in  Figure  4.18.  Ctie  way  we  could  do  this  at  this  point  is  to  perform 
confidence  interval  tests  of  whether  the  various  free  parameters  of  this 
model  are  equal  to  zero  or  not.  For  example,  we  may  wish  to  test  the 
hypothesis  that  the  structural  parameter  relating  the  first  latent 
endogenous  variable  to  the  second  latent  endogenous  variable  is  equal  to  zero. 
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The  maxim  un-likelihood  estimate  of  CL^  is  equal  to  .729  and  has  a  standard 
error  of  .054.  An  approximate  95%  confidence  interval  estimate  of  is 
thus  [  .729  -  2 (.054)  <  Ct^  <  .729  +  2 (.054)]  or  (.611  <  <  .837]. 

Since  the  interval  does  not  include  zero,  we  reject  the  hypothesis  that 
is  equal  to  zero  (cf.  Jflreskog  &  Sfirbom,  1979).  In  Figure  4.18 
the  standard  errors  of  each  of  the  free  structural  parameters  of  the  model 
are  indicated  in  parentheses  next  to  the  corresponding  maximum  likelihood 
estimates  of  these  parameters.  A  test  of  this  form  could  be  performed  for 
each  free  parameter  in  turn.  However,  the  tests  in  such  a  sequence  of 
tests  will  not  be  independent  and  will  have  an  unknown  bias  after  the  first 
test. 

Instead  of  performing  at  this  point  a  series  of  confidence  interval 
tests  of  the  free  parameters,  we  may  also  proceed  with  the  remainder  of  the 
a  priori  sequence  of  tests  described  earlier.  The  remaining  tests  will 
test  hypotheses  about  select  free  parameters  of  the  model  in  Figure  4.14 
and  4.18  using  difference-chi-square  statistics.  Tb  begin  with  consider 
the  model  in  Figure  4.15.  This  is  a  model  nested  within  the  previous  model 
in  Figure  4.14  in  which  the  structural  parameters  pertaining  to  the 
relationships  between  the  two  exogenous  and  the  two  latent  endogenous 
variables  (i.e.,  -21^  ^ave  *3een  fixed  to  zero.  All 

other  fixed  parameters  are  the  same.  For  the  model  in  Figure  4.15,  the  chi 
square  goodness-of-fit  statistic  with  67  degrees  of  freedom  is  equal  to 
1120.5375,  which  is  significant  at  the  .05  level.  The  difference  chi 
square  between  this  model  (Figure  4.15)  and  the  previous  model  (Figure  4.14 
or  4.18)  is  1061.8805  =  1120.5375  -  57.675,  with  3  =  67  -  64  degrees  of 
freedom,  which  is  significant  at  the  .05  level.  This  means  that  fixing  to 
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zero  the  3  additional  paraneters  pertaining  to  causal  relationships  among 
the  two  exogenous  and  the  two  latent  endogenous  variables  produces  a 
definite  lack  of  fit  of  the  model  to  the  data.  Hence  there  must  be  nonzero 
relationships  between  these  variables  as  specified  by  the  causal 
connections  hypothesized  between  them. 

Because  the  chi  square  statistic  for  goodness  of  fit  of  the  model 
in  Figure  4.15  is  significant,  we  cannot  proceed  with  any  further 
statistical  tests  for  the  fit  of  further  restricted  models  nested  within 
the  previous  models.  They  would  all  be  rejected.  We  thus  end  up  accepting 
the  structural  equation  model  repeated  in  Figures  4.9,  4.14  and  4.18  as 
adequately  fitting  the  thirteen  variables. 

However,  we  may  further  wish  to  apply  the  normed-fit  index  to  the 
model  in  Figure  4.18.  lb  obtain  this  value  we  first  must  obtain  the  value 
of  the  chi  square  goodness-of-fit  statistic  for  a  "null"  model  that 
proposes  that  the  covariances  between  different  variables  of  the  set  of  13 
manifest  variables  are  all  zero.  In  other  words,  the  hypothetical  model 
variance-covariance  matrix  is  presumed  to  be  a  diagonal  matrix  with 
zeros  in  its  off-diagonal  positions.  This  would  correspond  to  the  model 
illustrated  in  the  structural  diagram  in  Figure  4.16.  In  this  case  the 
value  of  the  chi  square  goodness-of-fit  statistic  with  78  degrees  of 
freedom  is  computed  to  be  5029.1802,  which  is  significant  at  the  .05  level. 
But  the  normed  fit  index  for  the  model  in  Figure  4.18  is 
(5029.1802  -  57. 675) /5029. 1802  =  .9885 
with  an  associated  reduction  in  degrees  of  freedom  in  going  from  the  null 
model  to  the  structural  equation  model  of 
(78  -  64)  =  14. 


The  efficiency  index  for  the  model  in  Figure  4.18  is  .9885/14  =  .0706. 
The  parsimonious  fit  index  is  (64/78)  x  .9885  =  .811.  These  results  mean 


we  have  gained  considerable  improvement  in  fit  at  the  expense  of  very 
little  loss  in  degrees  of  freedcm  in  going  from  the  null  model  to  the 
structural  equation  model  to  account  for  the  relationships  among  the  13 
manifest  variables.  This  provides  considerable  support  for  the  model  in 
Figures  4.9,  4.14,  and  4.18. 

Applications. 

A  number  of  latent  variable  models  are  available  in  addition  to  the 
recursive  model  we  used  to  illustrate  latent  variable  designs.  References 
to  a  large  number  of  studies  are  furnished  in  the  Annual  Review  chapter 
on  "Multivariate  Analysis  with  Latent  Variables:  Causal  Modeling,"  by 
Bentler  (1980) .  Che  can  also  find  overviews  of  a  number  of  important 
designs  in  JOreskog  (1978)  and  more  complete  illustrations  in  Jdreskog 
and  Sftrbom  (1979)  .  Of  special  interest  are  models  that  can  be  used  to 
test  assumptions  associated  with  selected  conditions  for  confirmatory 

t 

analysis,  using  latent  variable  designs.  For  example,  an  overview  of 
latent  variable  models  for  testing  reciprocal  causation  (Condition  4)  was 
presented  recently  by  Maruyama  and  Me Garvey  (1980)  .  [Unfortunately,  these 
authors  made  the  unwar  rented  assumption  that  disturbances  for  reciprocally 
related,  latent  endogenous  variables  were  unrelated.)  Tests  for  moderators, 
which  help  to  specify  boundaries  for  theory  (Condition  6) ,  have  been 
addressed  by  Werts,  Rock,  Linn,  and  Jflreskog  (1976),  Werts,  Rock,  and 
Grandy  (1979,  and  Werts,  Rock,  and  Linn  (1979).  The  question  of  stability 
of  structural  models  (Condition  7)  has  been  investigated  in  growth  and 


1 


t 


193 

longitudinal  studies  (JSreskog,  1979;  SSrbom,  1979;  Werts,  Jflreskog, 

&  Linn,  1972).  Finally,  questions  concerning  the  operationalization  of 
variables  and  constructs  (Condition  8),  such  as  method  variance  and  other 
forms  of  nonrandom  measurement  error,  have  received  considerable  attention 
in  the  latent  variable  literature  (cf.  Alwin  &  Jackson,  1980;  Kalleberg  & 
Kleugel,  1975;  Mulaik,  1975;  Mellenbergh,  Kelderman,  Stijlen  &  Zondag, 

1979;  Werts,  Jflreskog  &  Linn,  1976;  Werts,  Breland,  Grandy,  &  Rock,  1980). 
The  Conditions  for  Causal  Model ing  with  Latent  Variable  Models 

By  now  it  should  be  clear  that  to  do  confirmatory  analysis  with  latent 
variable  structural  models  one  still  has  to  satisfy  the  ten  conditions  for 
confirmatory  analysis  described  earlier  (Section  2)  for  models  with 
manifest  variables.  So  far,  our  discussion  of  latent  variable  models  has 
focused  on  methodological  procedures  for  determining  whether  structural 
parameters  associated  with  hypothesized  causal  connections  between 
variables  are  indeed  not  equal  to  zero  (Condition  9)  and  whether  the 
specifications  of  fixed  parameters  in  the  identification  of  the  model  are 
consistent  with  the  data  (Condition  10).  We  also  furnished  references 
regarding  models  and  tests  for  causal  direction  (Condition  4),  moderators 
(Condition  6),  stability  and  stationarity  (Condition  7),  and  construct 
validity  (Condition  8).  This  might  suggest  that  researchers  using  latent 
variable  models  have  addressed  conditions  for  confirmatory  analysis  in  a 
meaningful  and  thorough  manner.  However,  such  has  not  always  been  the 
case.  It  is  our  impression  that  many  of  the  conditions  have  been  violated, 
often  seriously,  in  published  confirmatory  analysis  using  latent  variables. 
We  shall  review  below  our  more  serious  concerns. 
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Violations  of  Conditions  _1,  2,  and  5_.  We  have  observed  numerous 
studies  with  violations  of  the  self-containment  condition  (Condition  5) , 
where  obvious  relevant  causal  variables  have  been  omitted  from  latent 
variable  structural  models.  These  omissions  of  relevant  causes  have  seaned 
not  to  be  due  to  a  dearth  of  knowledge  or  theory  available  to 
investigators  in  the  field,  but  due  to  a  superficial  regard  for  theory  and 
available  knowledge  (violations  of  Conditions  1  and  2)  accompanying  the 
rush  to  implement  a  new  methodology.  In  fact,  violations  of  Conditions  1 
and  2  may  be  the  root  of  most  misuses  of  confirmatory  methods  with  latent 
variables,  and  for  us  this  is  evidenced  by  the  many  studies  in  the 
literature  in  which  little  or  no  effort  has  been  made  to  provide 
theoretical  justification  for  the  model  to  be  analyzed.  It  is  time  that 
theoretical  concerns  receive  at  least  equal  emphasis  as  methodological 
concerns  in  latent  variable  analysis. 

Mi sinterpr etations  of  disturbance  variables  (disturbance  terms) . 
Another  common  mistake  is  to  short-circuit  the  distinction  between 
disturbance  variables  and  residual  variables.  Disturbance  variables  are 
not  "residual  variables",  that  is,  variables  representing  variation  in  the 
endogenous  variables  after  effects  due  to  preceding  causal  variables  in 
functional  equations  have  been  parti ailed  o^t  in  the  estimation  process. 

In  this  case  residual  variables  are  always  uncorrelated  with  causal 
variables  in  structural  equations  because  of  the  mathematics  of 
partial ling.  As  with  manifest  variable  models,  the  correct  way  to  view 
disturbance  variables  is  to  regard  them  as  causal  effects  on  the  endogenous 
variables  due  to  such  things  as  omitted  relevant  causal  variables,  random 
shocks,  and  misspecifications  of  equations  (e.g.  nonl inearity) . 
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The  extraneous  causal  influences  acting  on  an  endogenous  variable  may 
or  may  not  be  correlated  with  the  causal  variables  included  explicitly  in 
the  functional  equation  for  an  endogenous  variable.  If  in  a  functional 
equation  the  disturbance  variable  is  correlated  with  the  causal  variables 
(for  example,  the  disturbance  includes  relevant  causes)  ,  then  the 
functional  equation  and  the  system  of  functional  equations  will  not  be 
self-contained  (i.e.  a  violation  of  Condition  5),  and  it  will  be 
impossible  to  determine  in  a  consistent  way  the  effects  of  the  causal 
variables  on  the  endogenous  variables.  Thus,  we  must  have  good  reasons  to 
assume  that  the  disturbance  variables  are  uncorrelated  with  the  causal 
variables  in  functional  equations.  In  other  words,  we  must  know  of  no 
plausible  unmeasured  extraneous  variable  that  acts  upon  an  endogenous 
variable  and  at  the  same  time  correlates  with  the  causal  variables  of  the 
endogenous  variable.  If  this  is  the  case,  then  in  the  estimation  process 
residual  variables  can  be  tentatively  identified  with  disturbance 
var iables. 

In  a  related  vein,  another  frequent  mistake  with  latent  variable 
models  is  to  presume  without  prior  justification  that  the  disturbance 
variables  contain  only  error  of  measurement.  Often  the  variances  of  the 
disturbance  terms  are  interpreted  as  "error  variances"  and  used  to  estimate 
the  reliability  of  certain  manifest  variables.  (In  general  these  should  be 
biased  estimates  at  best) .  But  as  we  have  already  indicated,  the 
disturbances  may  contain  true  variance  that  is  associated  with  omitted 
causal  variables  not  included  in  the  study.  Another  misinterpretation  is 
to  assume  that  the  disturbance  variables  are  like  unique  factors  in  common 
factor  analysis,  which  are  always  mutually  uncorrelated.  But  it  is  quite 
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possible  to  have  correlated  disturbance  variables  because  omitted,  relevant 
causal  variables  represented  in  different  disturbances  can  be  correlated. 
Most  estimation  algorithms  allow  one  to  estimate  these  correlations,  but 
such  estimation  is  meaningful  only  in  certain  types  of  designs,  such  as 
nonrecursive  and  time-series  models. 

In  summary,  disturbance  variables  are  often  misunderstood  and 
misinterpreted  in  latent  variable  models,  lb  a  great  extent  this 
appears  to  be  again  attributable  to  the  lack  of  attention  to  Conditions 
1,  2,  and  5. 

Exploratory  rather  than  confirmatory  analysis.  A  serious 
misuse  of  the  methodology  of  confirmatory  analysis  with  latent  variables  is 
to  "play"  with  fixing  and  freeing  the  elements  of  the  parameter  matrices. 
That  is,  a  researcher  may  Conduct  numerous  analyses  with  different 
fixed  and  free  parameters  until  a  model  is  fornd  that  best  fits  the  data. 

It  is  also  not  unknown  for  some  researchers  to  change  assumptions 
regarding  causal  direction  for  the  same  purpose.  This  is  nothing  but 
exploratory  analysis  and  does  not  represent  a  confirmatory  analysis. 

Stated  simply  a  confirmatory  analysis  tests  only  a  priori  models 
vtoich  have  a  strong  theoretical  base.  This  does  not  preclude,  however, 
testing  several  a  pr io r i  models  on  the  same  sample. 

Fail  ure  to  over  identify  parameters.  Another  serious  shortcoming  of 
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many  confirmatory  analyses  using  both  manifest  and  latent  varj^f)les  is  to 
fail  to  over identify  the  structural  parameters  of  the  measurement  portion 
of  a  model  concerning  the  connection  between  latent  variables  and  their 
manifest  indicators.  As  mentioned  earlier  in  this  section, 
over  identification  is  essential  if  one  is  to  specify  a  model  in  ways  in 
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which  it  can  be  potentially  falsified.  As  it  most  often  happens,  failure 
to  overidentify  parameters  occurs  when  a  researcher  selects  only  three 
manifest  indicators  to  represent  a  latent  variable.  'While  it  is  true  that 
the  structural  coefficients  relating  three  manifest  indicators  to  a  latent 
variable  are  identified,  in  most  cases  they  are  not  overidentified  in 
models  containing  these  indicators.  As  a  result  it  is  not  possible  to  test 
whether  or  not  the  three  indicators  have  a  single  latent  variable  in 
common,  since  one  can  always  fit  a  single  latent  variable  perfectly  to 
three  manifest  variables.  Thus,  a  crucial  test  of  the  "measurement  model" 
part  of  a  structural  model  with  latent  variables  may  not  be  possible. 
Therefore,  we  recommend  that  each  latent  variable  in  a  model  be  represented 
by  at  least  four  manifest  indicator  variables.  In  this  case  the  structural 
parameters  of  the  "measurement"  submodels  relating  each  set  of  manifest 
indicators  to  their  respective  latent  variables  will  be  overidentified,  and 
it  will  be  possible  to  test  the  measurement  aspects  of  the  model. 

These  are  our  most  serious  concerns  with  respect  to  the  use  of  latent 
variables  in  confirmatory  analysis.  They  are  by  no  means  exhaustive  of  all 
the  concerns  in  this  area.  In  fact,  many  of  the  issues  we  raised  in  regard 
to  manifest  variable  models,  such  as  the  need  for  equilibrium- type 
conditions  and  specification  of  causal  intervals  in  time-series  analysis, 
apply  equally  to  latent  variable  models.  We  will  not  attempt  to  document 
all  of  these  issues  again  for  latent  variable  models.  Rather,  we  suggest 
that  the  users  of  the  latent  variable  approach  have  been  at  least  as 
cavalier  as  users  of  the  manifest  variable  approach  in  regard  to 
theoretical  justifications  for  using  confirmatory  analysis.  We  recommend 
strongly  that  theoretical  issues  be  given  greater  attention  in  latent 
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variable  designs. 

Causal  Inference  with  Latent  Variables 

There  are  dangers  and  difficulties  for  causal  inference  in  using  the 
latent  variable  concept,  because  it  implies  explanation  in  terms  of  unseen, 
not  directly  measureable  processes  and/or  entities.  The  danger  is  that 
when  a  model  with  latent  variables  appears  consistent  with  data,  one  may 
fail  to  entertain  the  possibility  of  alternative  models  for  the  same 
data.  Latent  variables  can  never  be  regarded  as  representing  unique 
explanations  of  any  phenomenon.  This  is  especially  to  be  remembered  in 
latent  variable  models  because  the  number  of  latent  variables,  including 
latent  disturbances,  will  always  exceed  the  number  of  manifest  variables. 

As  a  consequence,  the  latent  variables  can  never  be  uniquely  determined 
from  the  manifest  variables;  what  is  manifest  and  seen  cannot  uniquely 
define  what  is  latent  and  unseen.  This  result  is  now  well  knov*i  for  the 
latent  variable  model  of  exploratory  common  factor  analysis  (cf.  Mulaik 
&  McDonald,  1978;  McDonald  &  MLilaik,  1979  for  reviews  of  this  problem 
in  factor  analysis) .  But  the  result  also  generalizes  from  common  factor 
models  to  structural  equation  models  with  latent  variables. 

There  are  two  important  aspects  to  the  problem  of  indeterminacy  for 
latent  variables;  (1)  scores  on  latent  variables  are  not  uniquely  defined. 
(2)  the  identity  of  latent  variables  is  not  uniquely  determined  from  the 
manifest  variables  dependent  on  them. 

When  a  structural  equation  model  with  latent  variables  fits  a  set  of 
data,  one  may  naturally  want  to  have  a  way  to  obtain  measurements  on  the 
latent  variables.  Borrowing  the  factor  score  concept  from  factor  analysis. 
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some  have  tried  to  obtain  these  measurements  by  regressing  the  latent 
variables  onto  the  manifest  variables,  using  information  about  the 
covariances  among  the  manifest  variables,  and  between  the  manifest 
variables  and  the  latent  variables,  as  the  basis  for  the  regression 
estimates.  However,  the  regression  estimates  of  the  latent  variables  are 
not  equivalent  to  the  latent  variables,  differing  from  them  by  an 
indeterminate  amount  whose  variance  is  equal  to  the  error  of  estimate  in 
estimating  the  latent  variable  from  the  manifest  variables.  It  is  the 
unpredictable  part  of  the  latent  variable  that  is  indeterminate.  There  is 
no  unique  solution  for  this  part.  Hence,  one  can  construct  more  than  one 
solution  for  the  latent  variable  by  combining  the  predictable  part — 
obtained  from  the  regression  of  the  latent  variable  onto  the  manifest 
variables — with  a  variable  independent  of  the  predictable  part,  chosen 
arbitrarily  to  stand  for  the  unpredictable  part.  When  the  squared  multiple 
correlation  for  predicting  a  latent  variable  is  less  than  .70,  then  quite 
possibly  two  alternative  constructed  solutions  for  the  same  latent  variable 
can  be  correlated  not  just  zero  but  negatively  with  one  another.  The  use 
of  constructed  or  estimated  "factor  scores"  to  stand  for  a  latent  variable 
in  question  is  not  recommended  when  the  multiple  correlation  for  predicting 
a  latent  variable  is  less  than  .80.  Wien  the  multiple  correlation  is  less 
than  .80,  such  scores  may  show  moderate  to  strong  correlations  with  other 
variables  vrtiile  possibly  the  real  latent  variable  in  question  has  only  zero 
to  moderate  correlations  with  these  same  other  variables. 

More  problematic  is  the  indeterminacy  in  the  identity  of  latent 
variables  in  structural  equation  models,  lb  be  sure,  a  researcher  performs 
a  confirmatory  analysis  with  an  interpretation  already  given  to  the  latent 
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variables  of  the  analysis.  If  the  model  is  discontinued  because  the  latent 
variables  do  not  relate  to  their  respective  manifest  indicators  as  expected 
by  hypothesis,  then  one  may  question  either  the  choice  of  indicators,  or 
more  seriously,  the  interpretations  attached  to  the  latent  variables  of  the 
model  that  led  one  to  believe  that  the  manifest  variables  would  be  linked 
together  through  the  latent  variables. 

But  what  can  one  conclude  if  the  model  is  corroborated  by  showing 
acceptable  fit  to  the  manifest  variables?  Is  the  model  corroborated 
because  the  manifest  variables  have  as  causes  the  latent  variables  presumed 
at  the  outset  to  be  causes  of  them?  Or  is  the  model  corroborated  because 
the  manifest  variables  have  some  other  variables  as  causes?  In  particular, 
could  there  be  other  variables  having  the  same  pattern  of  relationships 
with  the  manifest  variables  as  one's  hypothetical  latent  variables,  but 
distinct  from  them?  The  mathematics  of  indeterminacy  for  latent  variables 
allows  for  this.  But  there  is  no  way  to  resolve  such  questions  other  than 
to  try  to  make  explicit  all  possible  alternative  interpretations  for  the 
results  and  then  perform  further  studies  to  differentiate  between  the 
interpretations.  But  the  indeterminacy  of  latent  variables  is  not  a  fatal 
flaw  of  latent  variable  models,  it  is  a  reflection  of  the  indeterminacy  in 
all  inductive  attempts  to  use  empirical  observations  to  confirm  theories  by 
examining  whether  consequences  deduced  from  the  theories  do  indeed  occur  as 
expected.  This  point  we  emphasized  earlier  in  Section  3. 

We  recognize  that  there  are  those  who  are  uncomfortable  with  unseen 
causes,  latent  variables,  and  hypothetical  constructs.  In  fact,  inspired 
by  the  19th  century  writings  of  the  empiricist  philosophers  Mach  and 
Pearson,  the  followers  of  the  positivist  movement  in  the  philosophy  of 
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science  sought  between  1920  and  1950  to  exclude  from  science  all  reference 
to  inseen  causes,  believing  that  incorrigible  knowledge  could  only  be 
derived  from  sensory  experience  and  hence  from  those  things  that  were 
observed.  Since  the  1950's  the  philosophy  of  science  has  taken  a  more 
pragmatic  and  less  dogmatic  and  absolutistic  position  with  respect  to 
knowledge.  The  result  is  that  the  use  of  "inseen  causes"  is  accepted  now 
with  greater  tolerance,  as  long  as  the  unseen  causes  allow  one  to  unify 
broader  domains  of  experience  than  before  and  are  interpreted  in  terms  of 
categories  that  in  principle  may  eventually  be  observable  or  predict  novel, 
observable  results.  It  is  interesting  to  note  that  Mach  rejected  totally 
atomic  theory  in  physics  because  atoms  were  "too  hypothetical",  while 
Pearson  opposed  the  Mendel ian  theory  of  genes  for  nearly  20  years  after 
Mendel's  work  was  discovered  in  1900,  partly  because  genes  were  not 
observable.  In  a  way  these  scientists'  rejection  of  what  were  at  the  time 
hypothetical  constructs  led  them  to  reject  what  were  to  become  the  next 
major  developments  in  their  respective  fields.  Thus  to  "see"  things 
differently,  and  this  includes  in  some  cases  "seeing"  in  things  what  is  not 
directly  visible  in  them,  may  be  essential  first  steps  to  achieving  better 
understanding  of  the  relationships  between  things  in  our  world. 
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Footnotes 

This  manuscript  was  prepared  for  a  conference  on  Innovations  in 
Methodology  for  Organizational  Research,  sponsored  by  the  Division  of 
Industrial  and  Organizational  Psychology  of  the  American  Psychological 
Association.  Support  for  the  conference  and  the  paper  has  been  provided 
jointly  by  the  Organizational  Effectiveness  Research  Program  of  the  Office  of 
Naval  Research  and  the  School  Management  and  Organization  Studies  Unit  of 
the  National  Institute  of  Education. 

Support  for  this  project  was  also  provided  under  Office  of  Naval  Research 
Contract  N00014-80-C-0315,  Office  of  Naval  Research  Project  M1170-904. 

Opinions  expressed  are  those  of  the  authors  and  are  not  to  be  construed  as 
necessarily  reflecting  the  official  view  or  endorsement  of  the  Department  of 
the  Navy. 

The  authors  wish  to  thank  ftjgh  J.  Arnold,  Joel  T.  Campbell,  Robert  G. 
Demaree,  Robert  L.  Linn,  Jack  McArdle,  S.  B.  Sells,  B.  Krishna  Singh,  and 
Gerrit  Wolf  for  their  helpful  suggestions  and  advice. 

^Theories  of  stress  often  differentiate  between  (a)  qualitative 
overload,  which  refers  to  demands  that  exceed  physical  and/ot  personnel 
resources,  and  (b)  quantitative  overload,  which  refers  to  demands  that  exceed 
available  time  limits  but  not  physical/personnel  resources  (cf.  Katz  & 

Kahn,  1978).  Both  sources  of  overload  are  addressed  in  the  illustration. 

^ftie  conditional  function  has  the  form  of  the  linear  regression 
function  (Lord  &  Novick,  196$t  Other  functions,  such  as  maximun  likelihood 
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functions,  may  be  employed  to  represent  functional  relations  (cf.  Jcfreskog  & 
Sorbcm,  1979). 

Methodology  has  also  been  developed  for  nominal  and  ordinal  scales  of 
measurement  (cf.  Carter,  1971;  Boyle,  1970;  Heise,  1972;  Lyons,  1977;  Spaeth, 
1975) . 

^The  latter  assumption  satisfies  the  "rank  condition"  required  for 
identification,  which  is  beyond  the  mathematical  scope  of  this  treatment.  It 
implies  that  estimates  of  structural  parameters  will  not  be  subject  to  at 
least  severe  bias  resulting  from  multicol linearity  (cf.  Billings  &  Wroten, 
1978;  Johnston,  1972). 

^Points  1  through  3  are  analogous  to  conventional  contrasts  between 
standardized  and  unstandardized  regression  weights,  which  are  not  elaborated 
here.  Decomposition  of  correlations  is  treated  in  Condition  10. 

6In  actuality  the  sample  variance-covariance  matrix  in  Table  4.3  was 
created  in  the  following  way:  First  800  observations  on  the  15  mutual 
incorrelated  independent  variables  of  the  model  in  Figures  4.9  and  4.14  were 
generated  using  a  computer  random  number  generator  that  produces  numbers  that 
are  approximately  normally  distributed  with  means  of  zero  and  standard 
deviations  of  unity.  The  sample  variance-covariance  matrix  for  these  15 
aritifically  generated  variables  was  then  calculated  and  used  as  a  sample 
estimate  $  of  the  matrix  of  covariances  among  the  independent  variables  of  the 

to* 

model  in  Figure  4.14.  Then  after  fixing  the  zero  and  other  fixed  elements  of 
the  matricesA,/1,  and  ^  to  specify  the  model  in  Figure  4.14,  arbitrary  values 
were  then  chosen  for  the  remaining  "free"  parameters  of  these  matrices  and  the 
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resulting  sample  variance-covariance  matrix  vas  derived  using  equations  4.9a 
4.9c.  Thus  the  matrix  in  Table  4.3.  simulates  a  sample  variance-covariance 
matrix  taken  from  a  population  distributed  according  to  the  multivariate 
normal  distribution  consistent  with  the  model  in  Figure  4.14. 

7 

The  estimates  of  the  free  parameters  of  the  models  discussed  in  this 
section  were  obtained  using  the  program  LISREL  IV  (Joreskog  &  Sorbom,  1978). 
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Figure 


Values  on  X:  1,1, 2,2, 3,3, 4, 4, 5, 5, 6, 6 
Values  on  Y:  1,1, 2, 2, 1,1, 3, 3, 1,1, 3, 3 
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1.1.  The  Functional  Relation  of  Variable  Y 
on  Variable  X  in  an  Asymmetric,  Self- 
Contained  System. 
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Figure  2.1.  Formal  Representation  of  a  Structural  Model 
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Figure  2.2.  Nonrecursive  structural  model  relating  police 
violence  and  protestor  violence.  (From  "An 
Introduction  to  the  Logic,  Assumptions,  and  Basic 
Analytic  Procedures  of  Two-Stage  Least  Squares"  by 
L.R.  James  and  B.K.  Singh,  Psychological  Bulletin, 
1978,  85,  1104-1122.  Copyright  1978  by  the  American 
Psychological  Association.  Reprinted  by  permission.) 


Figure  2.3.  cyclical  recursive  structural  model. 
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Figure  2.k.  Illustrations  of  self-contained  and  nonself -contained 
structural  models. 
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Figure  2.5a.  Social  system:  Nonbounded  time  interval 
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Figure  2.5b.  Causal  order  assuming  rapid  equilibration  times 
»  and  an  equilibrium- type  condition 


Figure  2.5.  Causal  order  and  the  equilibrium- type  condition. 


224 


Condition 


Condition 


Condition 


Condition 


Condition 


Condition 


Condition 


Figure  2.6 


Conditions  Pertaining  to  Appropriateness  of  Theoretical 
Models  for  Confirmatory  Analysis 


Formal  statement  of  theory  in  terms  of  a  structural  model. 

Development  of  a  structural  model  that  specifies  variables, 
causal  connections  among  variables,  and  functional  relations 
and  equations  that  relate  each  effect  to  all  of  its 
relevant  causes. 

Theoretical  rationale  for  causal  hypotheses. 

Use  of  theory  to  propose  how  causes  produce  effects  by 
introduction  of  mediating  mechanisms  to  help  to  explain 
nonobvious  covariation  among  variables  and  molar  causal 
connections  among  complex  variables. 

Specification  of  causal  order. 

Hypothesized  order  in  which  variables  occur  naturally 
in  a  system  of  ordered  functional  equations,  given  an 
equilibrium- type  condition  for  cross-sectional  designs 
and  specified  causal  intervals,  stationarity ,  and  an 
equilibrium- type  condition  for  time  series  designs. 

:  Specification  of  causal  direction. 

Hypothesized  direction  of  causation  for  each  causal 
connnection  in  a  structural  model.  The  direction  may 
be  asymmetric,  denoting  a  recursive  causal  relation,  or 
reciprocal,  denoting  a  nonrecursive  causal  relation. 

:  Self-contained  functional  equations. 

The  functional  equation  for  each  effect  (endogenous 
variable)  in  a  structural  model  contains  all  the  relevant 
causes  of  that  effect,  which  is  indicated  by  lack  of 
covariation  between  the  explicitly  measured  causes  in 
an  equation  and  the  disturbance  term  of  that  equation. 

:  Specification  of  boundaries. 

Given  linearity  in  parameters  and  variables,  the 
functional  equations  are  additive  within  the  populations 
(e.g.,  subjects  and  environments)  to  which  inferences  are 
to  be  made . 

Stability  of  structural  model. 

The  values  of  structural  parameters  are  invariant 
(stationary)  over  specified  time  intervals,  and  the  values 
on  variables  representing  events  are  in  an  equilibrium- 
type  condition. 


(A)  Self-Contained  Path  Model 


(B)  Path  Model  with  an 
Unmeasured  Variable 


Figure  2.10.  Illustrations  of  a  self-contained  path  model  and 
a  path  model  with  an  unmeasured  variable 


Equations 
Path  Equations 


Expectations 


for  Figure  2.10a  -  Self-Contained  Path  Model 

z2  =  ^21zl  +  <d2=RS^ 

=  P^l2!  p32z2  ^3=RS) 

E(z2Z^)  =  P2]_E(z^z^)  +  E(RSz^) 

ECz^z^)  =  P21E(z1z1)  +  p^E^z^)  +  ECRSz^) 

E(z3z2)  =  p2^E(z^Z2>  +  P22E(z2z2^  +  E(RSz2^ 
Note:  E(RSz1 )  =  E(RSzx)  =  E(RSz2)  =  0 


r21  =  ^21  (2.11a) 

Normal  Equations  „  ,  „ 

r32  '  P31  +  P32r21 

r32  =  p31r21  +  p32 


(2.11b) 

(2.11c) 


Equations  for  Figure  2.10b  -  Path  Model  with  an  Unmeasured  Variable 

z ^  =  Pzizi  ( RS  a) 

Path  Equations  ^  +  +  (RS  +  a) 

E(z2z1)  =  p91E(z1z1)  +  E(RS  +  a)  z-, 


2111 


Expectations 


E(z;jz1)  =  p3^E(z1z1)  +  p32E(z2z1)  +  E(RS  +  a)Zj^ 
E(z3Z2)  *  p31E^zlz2^  +  p32E(z2z2>  +  E(RS  +  a)Z2 


*2\  ~  p21  aal :  E(RS+a) z^  =  (2. 

Normal  Equations 

r31  =  P31  +  p32r21  +  cal  (2' 

r32  =  p3lr21  +  p32  +  °a2  (2. 


Figure  2.11.  Path  equations,  expectations,  and  normal  equations  for 
Figures  2.10a  and  2.10b. 
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False  Model 

Assumes  ^  =  RS  when  in  fact 
U2  =  RS  +  a 


True  Model 

Assumes  a  is  measured,  and 
thus  U2  is  equal  to  RS 


H  ■  p21zl  +  u2 


Path  Equations 

z2  ‘  p21zl  +  p2aa  +  u2 


Expectations 

^Z2Z1^  ~  P2^E(z^z^)  i  ^(^2^1^  E(z2Zi)-p2^E( z^z]^*P2a^azl  ^"*^U2Z1 ) 

/  / 

E(z2a)=P2^E(z^a)+p2aE(aa)+E(u2a) 


Normal  Equations 

r21  =  P21 

Note:  False  assumption  that 

E(u2Z^)  =  E(RS  +  a)z 


r21  "  p21  +  p2aral 
r2a  =  p21ral  +  p2a 


Bias 


in  Using 


as  Solution  for 


P21 


r21  =  P21  ^alse  model) 

=  p^  +  P2aral  (true  model), 

thus,  P2^  (false  model)  differs  from  p^  (true  model) 
by  a  factor  of: 


p2aral 


(2.12a) 


Figure  2.12.  Biased  solution  of  path  coefficient 
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Solutions  for  Path  Coefficients: 


.40 


2.13a.  Solutions  for  path  coefficients  in  a  self-contained  path  equation 
(true  model).  Asterisks  indicate  correlations. 


Zl 


(-70)*  > 


u2  (=RS  +  a) 


Path  Equation:  Z2  =  ?2lz\  +  u2  ^^S+a) 

Solution  for  Path  Coefficient:  P2^  =  *21  =  *^0 

2.13b.  Solution  for  path  coefficient  in  nonself-contained  path  equation 
(false  model) . 

P21  =  P21  +  p2aral 
.70  =  .50  +  ( . 40 ) ( . 50 ) 

=  .50  +  .20 

Bias  in  p21  =  P2aral  =  *20 
2.13c.  Bias  in  estimate  of  P2^  (false  model). 

Figure  2.13.  Empirical  illustration  of  bias  resulting  from  an  unmeasured 
relevant  cause  and  a  nonself -contained  path  model. 
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Step  I .  Attempt  to  identify  known  major  and  moderate  causes  of  the  endogenous 
variable. 

A.  If  data  have  not  been  collected,  then  attempt  to  measure  the  major/ 
moderate  causes  unless  there  appears  to  be  a  good  reason  not  to 
include  one  or  more  of  these  variables,  as  determined  in  Step  II. 

B.  If  data  have  already  been  collected,  then  attempt  to  identify 
known  major/moderate  unmeasured  causes.  If  one  or  more  such 

causes  is  believed  to  exist,  proceed  to  Step  II.  If  no  major /moderate 
unmeasured  causes  are  believed  to  exist,  then  exit  from  the  decision 
steps  at  this  point  [i.e.,  a  serious  unmeasured  variables  problem 
appears  to  be  unlikely  for  this  endogenous  variable,  at  least  from 
the  perspective  of  the  decision-maker (s) ] . 

Step  II.  Postulate  whether  each  (maj or/moderate)  unmeasured  cause  is  correlated 
with  one  or  more  of  the  measured  causes,  using  prior  empirical  evidence  whenever 
possible  to  support  the  postulates.  In  designing  a  path  analysis  study,  this 
step  and  those  to  follow  are  meant  to  be  viewed  in  terms  of  causes  that  are  not 
as  yet  in  the  causal  model,  as  compared  to  causes  already  included  in  the  model. 

A.  If  the  correlations  between  an  unmeasured  cause  and  all  of  the  measured 
causes  are  presumed  to  be  low  (e.g.,  0  to  ±  .20,  although  this  is 
arbitrary),  then  exit  here  for  that  unmeasured  cause.  Note,  however, 

that  if  a  different  unmeasured  cause  is  included  later  in  the  causal 
model,  then  the  decisions  regarding  prior  unmeasured  causes  should  be 
reevaluated  (this  applies  to  all  of  the  following  steps).  Furthermore, 
an  exit  at  this  point  suggests  that  the  explanatory  power  of  the 
causal  model  in  regard  to  the  endogenous  variable  of  interest  will 
be  reduced.  On  the  other  hand,  if  the  judgment  is  correct  that  all 
correlations  between  the  unmeasured  cause  and  the  measured  causes 
are  low,  then  the  solutions  of  the  path  coefficients  for  the  measured 
causes  are  not  likely  to  be  seriously  biased. 

B.  If  an  unmeasured  cause  is  believed  to  have  a  moderate  to  high 
correlation  with  one  or  more  of  the  measured  causes,  then  consider 
whether  the  unmeasured  cause  is  essentially  linearly  dependent  on 
some  combination  of  the  measured  causes.  If  prior  research  and/or 
judgment  allow  one  to  have  confidence  in  an  affirmative  response  to 
one  of  these  considerations,  then  exit  at  this  point.  Note  again, 
however,  that  while  the  exit  suggests  lack  of  serious  bias,  this  will 
occur  only  if  the  judgments  are  correct. 

Step  III.  By  reaching  Step  III,  it  has  been  decided  that  (a)  at  least  one 
unmeasured  major /moderate  cause  exists  for  the  endogenous  variable  of  interest, 

(b)  the  unmeasured  cause  is  correlated  at  least  moderately  with  one  or  more  of  the 
measured  causes,  and  (c)  the  unmeasured  cause  is  not  linearly  dependent  on  some 
combination  of  the  measured  causes.  This  suggests  that  a  serious  unmeasured  variables 
problem  exists  and  that  an  attempt  to  solve  for  the  path  coefficients  for  this 
endogenous  variable  based  on  the  measured  causes  is  likely  to  result  in  at  least 
one  seriously  biased  solution.  Consequently,  it  is  recommended  that  path  analytic 
procedures  not  oe  employed  for  this  endogenous  variable  until  the  unmeasured 
cause(s)  is  in  fact  measured.  A  less  desirable  possibility  might  be  to  delete 
measured  causes  that  are  presumed  to  be  correlated  with  unmeasured  causes. 


Figure  2.14.  Decision  steps  for  assessing  the  seriousness  of  unmeasured  variables  problem. 

(From  "The  Unmeasured  Variables  Problem  in  Path  Analysis"  by  Lawrence  R.  James 
Journal  of  Applied  Psychology.  1980,  65,  415,421.  Copyright  1980  by  the 
American  Psychological  Association.  Reprinted  by  permission.) 
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Z-,  (Similarity  of 
1  Interests) 


'21 


-»  z2  (Attraction) 


32 


/ 

-►  z3  (Friendship) 


Path  Equations 

z2  —  P21  Z1  +  u2  (2.15a) 

=  P32  z2  L!3  (2.15b) 


2.15a.  z-^  is  an  indirect  cause  of  z^ 


21 


22  Attraction 


Path  Equations 

z2  ■  P21  Z1  +  u2  (2.15c) 

z3=  P31  Z1  +  p32  z2  +  u3  (2 .  i5d) 


2.15b.  z^  is  a  relevant  cause  of  z^ 


Figure  2.15.  Contrasting  path  models  with  treated  as  an 
indirect  cause  and  a  relevant  cause 
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Figure  2.17a.  Path  model  with  P41  predicted  to  be  equal  to  zero. 
z2  =  p21zl  +  u2 
z3  =  p31zl  +  p32z2  +  u3 

z4  =  (p41~0)zl  +  p42z2  +  P43Z3  +  u4  =  p42z2  +  p43z3  +  u4 

Figure  2.17b.  Path  equations. 
r21  =  P21 

r31  =  p32p21  +  P31 

r32  =  p32  +  p  3 1 P  2 1 

r41*  =  p43p32p21  +  p43p31  +  p42p21 

r42*  =  P43P32  +  P43P31P21  +  p42  +^rtp?T 

r43*  =  p43  +  p42p32  +  p42p3lp21  +  +  .2*3*3! 


(2.17a) 

(2.17b) 

(2.17c) 


Figure  2.17c.  Decomposition  equations;  asterisks  denote  correlations 
subject  to  overidentifying  restriction. 


Correlation  Matrix 
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Figure  2.17.  Reproduced  correlation  test  for  a  path  model  with  p^  predicted  to  be  equal 
to  zero  and  two  correlation  matrices. 


FALSE  MODEL 


TRUE 


Path  Equations 

z2  =  p21zl  +  u2 
* 

Z3  —  (P3x=0 ) +  P22z2  u3 


Predictions 


P2^  and  p^  are  significant 

^  k 

p^^  is  not  significant 


MODEL 


Path  Equations 
z2  =  P21  +  u2 

z3  =  (p31=0)zl  +  p32z2  +  p3aza  +  u3 
Predictions 

p2i»  P32'  anc*  P3a  are  significant 
.  * 

p^^  is  not  significant 


Figure  3.1.  Confirmation  of  a  false  model. 
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Table  4.2.  Symbols  used  and  their  meanings  in  notation  for  structural 
equation  models. 


Symbol  Meaning 

£ ;  Latent  exogenous  variable 

n  Latent  endogenous  variable 

e  Latent  disturbance  variable. 


x  Manifest  exogenous  variable. 

y  Manifest  endogenous  variable. 

a  Structural  parameter  relating  one  endogenous 

variable  to  another. 


y  Structural  parameter  relating  an  exogenous 

variable  to  an  endogenous  variable. 

<5  Structural  parameter  relating  a  disturbance 

variable  to  an  endogenous  variable. 

£  Random  vector  of  latent  exogenous  variables. 

x  Random  vector  of  manifest  exogenous  variables. 

e  Random  vector  of  disturbance  variables. 


n 


Random  vector  of  latent  endogenous  variables. 


y 


Random  vector  of  manifest  endogenous  variables. 


_£  * 

n  * 


* 


Random  vector  of  independent  variables. 

<■**  -  [«■'»  e* I- 

w  **■ 

Random  vector  of  dependent  variables, 
n*'  =  [n* ,  y'] . 


■©■I  Ul 
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Table  4.2  (Continued) .  Symbols  used  and  their  meaning  in 
notation  for  structural  equation  models. 


Symbol 


Meaning 


A 

B 

r 

A 

r* 


Alpha  matrix  of  structural  coefficients  for 
endogenous  variables. 

Beta  matrix,  B  =  I_  -  A. 

Gamma  matrix  of  structural  coefficients  relating 
exogenous  to  endogenous  variables. 

Delta  matrix  of  structural  coefficients  relating 
disturbance  variables  to  endogenous  variables. 

Gamma  star  matrix,  F*'  =  [T:A]. 

Phi  matrix  of  covariances  among  independent  variables. 

Selection  matrix  to  select  manifest  dependent 
variables  from  n*. 

Selection  matrix  to  select  manifest  independent 
variables  from  £,* . 

Population  covariance  matrix  for  manifest  variables. 

Hypothetical  population  covariance  matrix  for 
manifest  variables  according  to  a  structural  model. 


Sample  covariance  matrix  for  manifest  variables. 

Estimated  hypothetical  covariance  matrix  for  manifest 
variables  according  to  a  structural  model. 
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Expected  Quality 


Expected  Quantity 


Fig.  4.1a.  Latent  structural  model 


Role  Overload 


State  Anxiety 


£  r  Reduction  — Watte 


e _ »  Reduction— Faulty  Unit* 


e _ -  Increased  inspections 


£  _ k  Increased  Dependability 


Expected 

Quality 


Role 

Overload 


Expected 

Quantity 


Anxiety 


Extended  Hours 


Insufficient  Personnel  ^ e 


Insufficient  Time 


Managerial  Pressure 


^  S— T  Inventory 

Production  Quotas  * -  Expected  State 

Quantity  Anxiety  ^ - 

S.  Gastrointestional 

Headaches  «- 

Fig.  4.1b.  Measurement  model  linking  latent  variables  to  manifest  indicators 

Figure  4.1.  Illustrations  of  a  structural  model  and  a  measurement  model  for  a  latent 
variable  analysis. 
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Figure  4.2.  An  illustration  of  a  latent  variable  model. 
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Expected 

Quantity 


(3) 


Role  Overload 


>  (4)  State  Anxiety 


Fig.  4.3a.  Latent  structural  model  with  path  coefficients  linking 
latent  variables. 
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Production  Quotas 

Fig.  4.3b.  Latent  variable  model  with  one  manifest  indicator  for 
each  latent  variable. 


Increased  P,  „ 
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(3)  Insufficient  - >  (4)  Headaches 

Time 


Fig.  4.3c.  Manifest  variable  model  with  biased  estimates  of  path 
coefficients 


k  Figure  4.3.  Illustration  of  bias  in  estimates  of  path  coefficients  resulting 

from  measurement  error  in  manifest  variables. 


Figure  4.7.  A  structural  model  with  two 
exogenous  variables  and  five  endogenous 
variables.  Cl  is  a  latent  variable  with 
four  indicators,  y^,  . ..,  y,..  ^  a 

manifest  exogenous  variable.  ^  is  a 
manifest  endogenous  variable  caused  by 


standard  errors  of  the  corresponding  parameter 


deleted 


